Mathematics:  A  Third  Level  Course 
The  Open  University 

9 

Unit  10 

Epidemic^ 


M343  APPLICATIONS  OF  PROBABILITY 


M343  APPLICATIONS  OF  PROBABILITY 

Mathematics:  A  Third  Level  Course 
The  Open  University 


9 

Unit  10 
Epidemics 


Prepared  by  Andris  Abakuks  (Birkbeck  College,  University  of  London) 
and  the  Course  Team 


CONTENTS 


Introduction 

3 

1  Modelling  the  spread  of  an  infectious  disease 

5 

2  The  simple  epidemic 

8 

2.1  Setting  up  the  model 

9 

2.2  The  deterministic  model 

9 

2.3  The  stochastic  model 

11 

2.4  The  duration  of  the  simple  epidemic 

13 

3  The  general  epidemic 

15 

3.1  Setting  up  the  model 

16 

3.2  The  deterministic  model 

17 

3.3  The  threshold  phenomenon 

20 

3.4  A  variation  on  the  model 

21 

4  The  stochastic  general  epidemic  model 

21 

4.1  Setting  up  the  model  (Audio) 

22 

4.2  The  threshold  phenomenon  ( Video) 

29 

5  Other  models  for  epidemics 

33 

Objectives 

34 

Appendix:  Solutions  to  questions 

35 

The  Open  University 


Statistics  tables 


The  recommended  book  of  statistics  tables  for  this  course  is  H.  R.  Neave, 
Elementary  Statistics  Tables  (George  Allen  &  Unwin,  1981).  In  this  unit,  these 
tables  are  referred  to  as  Neave. 


Unit  titles 


1 

Random  Processes 

9 

Queues 

2 

Events  in  Time 

10 

Epidemics 

3 

Patterns  in  Space 

11 

More  Population  Models 

4 

Branching  Processes 

12 

Genetics 

5 

Random  Walks 

13 

Renewal  Models 

6 

Markov  Chains 

14 

Diffusion  Processes 

7 

Birth  Processes 

15 

Time  Series 

8 

Birth  and  Death  Processes 

16 

Problems,  Problems,  Problems, 

The  Open  University  Press,  Walton  Hall,  Milton  Keynes. 

First  published  1988.  Reprinted  1993,  1997. 

Copyright  ©  1988  The  Open  University 

All  rights  reserved.  No  part  of  this  publication  may  be  reproduced,  stored  in  a  retrieval 
system  or  transmitted  in  any  form  or  by  any  means,  without  written  permission  from  the 
publisher. 

Designed  by  the  Graphic  Design  Group  of  the  Open  University. 

Printed  in  the  United  Kingdom  by  Page  Bros,  Norwich. 

ISBN  0  335  14294  X 

This  text  forms  part  of  the  correspondence  element  of  an  Open  University  Third  Level 
Course. 

For  general  availability  of  supporting  material  referred  to  in  this  text,  please  write  to: 
Open  University  Educational  Enterprises  Limited,  12  Cofferidge  Close,  Stony  Stratford. 
Milton  Keynes  MK1 1  1BY,  Great  Britain. 

Further  information  on  Open  University  courses  may  be  obtained  from:  The  Admissions 
Office,  The  Open  University,  P.O.  Box  48,  Milton  Keynes  MK7  6AB. 

1.3 


Introduction 


Throughout  history,  epidemics  of  infectious  diseases  have  not  only  caused 
immeasurable  human  suffering,  but  have  often  played  a  decisive  role  in  the 
destruction  of  armies  and  of  empires.  They  have  exercised  a  profound  influence  on 
rates  of  population  growth.  Although  such  diseases  as  plague,  smallpox  and  typhus 
fever  are  no  longer  the  common  cause  of  death  that  they  once  were,  other  diseases 
have  remained,  or  have  become,  important  as  major  sources  of  illness.  For 
example,  in  the  tropics,  malaria  and  schistosomiasis  are  widespread  debilitating 
diseases.  Every  few  years,  there  occurs  a  worldwide  pandemic  of  influenza, 
facilitated  by  the  ever-increasing  amount  of  air  travel  between  the  cities  of  the 
world. 

The  control  of  infectious  diseases  is  still  a  major  concern  of  medical  science,  and 
mathematical  models  have  a  part  to  play  in  helping  to  understand  how  epidemics 
of  such  diseases  spread,  and  how  they  may  be  controlled.  In  this  unit,  we  shall  be 
examining  some  of  the  basic  epidemic  models. 

In  Section  1,  some  epidemiological  principles  relevant  to  the  construction  of 
mathematical  models  are  outlined.  For  instance,  to  construct  an  even  roughly 
competent  model,  it  is  necessary  to  know  something  of  the  way  in  which  diseases 
are  passed  from  one  individual  to  another — when  they  can  be  passed  on,  and 
when  not;  when  an  individual  may  be  regarded  as  being  immune  to  a  disease,  or 
as  having  recovered  from  it;  and  so  on.  In  Section  2,  the  so-called  simple  epidemic 
model  is  described.  This  model  applies  to  a  situation  where  individuals,  once  they 
have  caught  a  disease,  remain  for  ever  after  infectious  and  capable  of 
communicating  the  disease  to  anybody  who  has  not  caught  it.  A  particular 
consequence  in  a  closed  community  is  that  eventually  everybody  will  have  caught 
the  disease.  The  sort  of  question  which  is  then  of  interest  is,  for  example,  how  long 
does  that  process  take. 

Sections  3  and  4  deal  respectively  with  deterministic  and  stochastic  formulations 
for  what  has  become  known  as  the  general  epidemic  model.  In  this  case,  infected 
individuals  do  not  remain  infectious  for  ever:  in  fact,  after  some  random  time,  they 
recover  from  the  disease  and  can  no  longer  pass  it  on.  They  remain  in  the 
community,  but  it  is  also  assumed  that  once  they  have  caught  the  disease  and  have 
recovered  from  it,  they  are  thereafter  immune  from  further  attacks.  Because  it  is 
possible  to  recover  from  the  disease  without  passing  it  on  to  anybody,  or  after 
passing  it  on  to  only  a  small  number  of  people,  one  possibility  with  the  general 
epidemic  model  which  distinguishes  it  from  the  simple  epidemic  model  is  that  the 
epidemic  may  die  out  before  every  member  of  the  community  has  been  afflicted 
with  the  disease.  Then  interest  centres  not  only  on  how  long  this  might  take,  but 
also  on  the  extent  to  which  the  community  is  affected.  Is  the  disease  more  likely  to 
affect  almost  everybody  than  it  is  to  disappear  from  the  community  almost 
unnoticed?  The  end  of  Section  4  would  be  a  good  point  to  watch  Band  F  of  the 
video-cassette. 

The  final  section  considers  very  briefly  some  situations  for  which  the  simple 
epidemic  and  general  epidemic  models  are  not  suitable. 

The  first  section  is  quite  short,  but  constitutes  essential  preparatory  reading  for  the 
rest  of  the  unit.  The  next  three  sections  are  all  about  the  same  length,  and  do  need 
to  be  studied  in  the  order  presented.  The  final  section  is  extremely  short. 

An  audio-cassette  session  occurs  in  Subsection  4.1;  this  is  about  the  total  number 
of  individuals  to  have  contracted  a  disease  over  the  entire  course  of  an  epidemic, 
and  the  number  who  have  escaped  unscathed. 

The  unit  can  be  regarded  as  the  fourth  of  the  sequence  which,  beginning  with  the 
simple  birth  process  of  Unit  7,  has  dealt  with  integer-valued  random  processes  in 
continuous  time.  It  is  just  another  context  for  a  model  of  the  general  birth  and 
death  type.  It  is  slightly  more  complicated,  as  in  the  case  of  the  general  epidemic 
model  there  are  two  populations  to  count,  not  one — those  at  risk  from  a  disease, 
and  those  who  have  contracted  it — but  the  principles  remain  the  same.  More 


emphasis  than  usual  is  placed  on  deterministic  models,  for  even  in  the  case  of  the 
simple  epidemic  model,  usable  general  results  for  stochastic  models  are  rare. 
Throughout  the  unit,  therefore,  you  will  be  asked  to  perform  sometimes  fairly 
detailed  calculations  of  probabilities  (like  survivor  distributions)  by  hand.  For 
small  populations  this  is  not  too  time-consuming.  For  larger  populations  one 
would  use  a  computer.  (You  will  not  be  asked  to  do  this.) 

Before  you  study  this  unit,  you  should  be  familiar  with  the  material  of  Units  7 
and  8,  which  cover  the  principles  of  the  birth  and  death  process.  Do  not  worry  if 
you  do  not  have  the  material  of  Unit  9  quite  at  your  fingertips:  there,  as  here,  a 
specific  application  of  the  principles  is  described,  and  you  do  not  need  to  have 
studied  queues  to  find  out  about  epidemics. 


4 


1  Modelling  the  spread  of  an  infectious  disease 


In  this  unit  we  shall  study  some  models  for  the  spread  of  infectious  (or,  as  they  are 
often  termed,  communicable)  diseases  transmitted  by  infected  individuals  to 
susceptible  individuals  in  a  community.  The  sorts  of  disease  to  which  the  models 
could  be  applied  are  such  viral  infections  as  the  common  cold,  influenza  and 
measles — and  the  now-eradicated  smallpox. 

Before  we  can  begin  the  task  of  constructing  suitable  mathematical  models,  it  is 
necessary  to  understand  the  stages  of  the  development  of  a  disease  after  a 
susceptible  individual  is  infected,  and  to  become  familiar  with  the  terminology  used 
by  epidemiologists. 

It  is  assumed  that  initially  at  least  one  person  is  infected  with  the  disease.  After  the 
infection  has  been  communicated  by  an  infected  individual  to  a  susceptible  one, 
there  is  to  begin  with  a  latent  period,  during  which  the  disease  develops  within  the 
newly-infected  individual:  but  he  (or  she)  is  not  yet  capable  of  passing  on  the 
infection  to  anybody  else.  This  latent  period  is  followed  by  an  infectious  period, 
during  which  the  infected  person,  or  infective,  is  capable  of  transmitting  the 
disease.  At  the  end  of  this  infectious  period,  the  infective  becomes  again  incapable 
of  passing  on  the  disease.  He  might  be  dead;  or,  if  recovered,  he  might  be  immune 
from  further  attacks  of  the  same  disease  (as  in  the  case  of  measles). 

When  the  individual  is  first  infected,  there  is  also  an  incubation  period,  and  this  is 
the  period  until  the  first  symptoms  of  the  disease  appear.  All  these  concepts  are 
illustrated  schematically  in  Figure  1.1.  The  incubation  period  is  generally  longer 
than  the  latent  period,  and  this  is  also  shown  on  the  diagram. 


Figure  1.1  Stages  in  the  infection  of  a 
susceptible  individual 


For  at  least  some  diseases,  in  some  communities,  when  symptoms  appear  at  the 
end  of  the  incubation  period  the  infected  individual  is  isolated  so  that  he  can  no 
longer  pass  on  the  disease  to  others.  Even  so,  there  remains  the  period  of  time 
from  the  end  of  the  latent  period  to  the  onset  of  symptoms,  during  which  the 
infectious  person  is  circulating  in  the  community.  Where  a  sufferer  is  not  isolated, 
he  may  continue  to  transmit  the  disease  until  the  end  of  the  infectious  period,  at 
which  time  he  either  recovers  or,  in  the  case  of  fatal  diseases,  dies.  After  recovery, 
he  may  be  at  least  temporarily  immune  to  the  disease.  The  term  removal  is  used  to 
refer  to  the  end  of  the  infectious  period,  and  may  cover  isolation,  recovery  or 
death.  In  any  case,  after  removal,  the  individual  is  assumed  to  play  no  further  part 
in  the  spread  of  the  disease.  The  salient  features  for  the  mathematical  modelling  of 
an  infectious  disease  are  illustrated  in  Figure  1.2. 


^Time 


Figure  1.2  Stages  relevant  to  the  modelling  of 
an  infectious  disease 


An  epidemiologist  is  a  scientist 
concerned  with  the  control  of 
epidemic  diseases.  From  the  Greek 
epi:  among  +  demos :  people. 


One  aspect  of  the  modelling  process  involves  specifying  the  lengths  of  the  latent 
period  and  of  the  period  during  which  an  individual  is  infectious  and  in 
circulation.  These  periods  may  be  taken  to  be  of  fixed  length  (in  some  cases  zero), 
or  to  have  lengths  specified  by  some  probability  distribution.  In  the  latter  case,  for 
the  models  with  which  we  shall  be  dealing  in  this  course,  the  exponential 
distribution  is  used.  Where  the  latent  period  is  taken  to  have  zero  length,  then  ' 
only  three  classes  of  individuals  need  to  be  considered  (that  is,  counted) — the 
susceptibles  (those  liable  to  the  disease  if  they  meet  an  infected  person),  the 
infectives  (those  capable  of  passing  on  the  disease)  and  those  removed  (those  who 
have  had  the  disease  but  who  are  no  longer  capable  of  passing  it  on).  Any 
individual  who  is  eventually  infected  passes  through  these  three  stages: 

SUSCEPTIBLE  — >  INFECTIVE  — >  REMOVED. 

These  are  the  technical  terms  which  will  always  be  used. 

In  constructing  the  models,  assumptions  also  have  to  be  made  about  the 
community  in  which  the  spread  of  a  disease  is  being  modelled.  There  is  the  size  of 
the  community  to  be  considered.  At  one  extreme,  we  might  be  interested  in  the 
spread  of  the  disease  in  a  small  household— will  you  catch  your  spouse’s  cold 
(hope  not);  or  will  the  quin  with  mumps  pass  it  on  to  the  other  four  (hope  so)? 
Other  possibilities  include  a  school  and  a  village.  At  the  other  extreme,  we  mieht 
be  concerned  with  a  disease  spreading  through  a  population  consisting  of  millions 
of  individuals. 

In  other  units  of  this  course,  populations  have  been  modelled  which  have  changed 
size  as  individuals  are  born,  or  die;  as  others  arrive  and  as  others  depart.  Of 
course,  such  things  as  birth,  death  and  migration  are  bound  to  occur  in 
communities  where  there  is  disease.  However,  in  the  epidemic  models  that  we  shad 
consider  in  this  course,  the  population  will  be  assumed  to  be  closed.  By  this  is 
meant  that  during  the  course  of  the  epidemic,  no  individual  either  leaves  the 
community  or  joins  it  from  outside,  and  no  births  take  place.  In  the  nature  of 
things,  people  might  die,  but  for  the  purposes  of  accounting,  removed  individuals 
are  regarded  as  still  belonging  to  the  population.  As  a  consequence,  the  total  size 
of  the  population  remains  constant. 

Example  1.1 

Here  are  three  typical  scenarios.  In  each  case  a  disease  is  spreading  through  a 
community  whose  total  size  is  known.  The  disease  is,  to  some  extent,  monitored: 
at  various  stages  during  the  course  of  the  epidemic  it  is  known  how  many  are 
suffering,  how  many  have  recovered  (if  any),  and  how  many  might  yet  catch  the 
disease. 

(i)  In  a  family  of  six  (two  parents  and  four  children),  one  of  the  children  catches  a 
bug .  Next  day,  both  parents  and  one  of  the  other  children  are  also  down 
with  the  same  ailment.  Two  days  later,  they  all  have  it. 

(ii)  A  small  school  comprises  30  young  children.  On  Sunday  (unknown  to  parents 
and  teachers)  one  of  them  catches  an  infectious  disease,  and  takes  it  to  school 
on  Monday.  On  Tuesday,  7  children  are  ill  with  the  disease,  and  are  infectious 
(and  the  other  23  are  well,  but  susceptible).  On  Wednesday,  3  have  recovered 
(removed),  18  are  ill  (infectious)  and  9  have  yet  to  catch  the  disease 
(susceptible).  By  Friday  there  are  13  ill  children— all  the  remaining  17  children 
have  had  the  disease  but  have  recovered.  By  the  next  Monday,  all  30  children 
are  well  again. 

(iii)  Plague  breaks  out  in  a  small  peasant  hamlet  where  there  are  a  hundred 
inhabitants.  A  month  later  all  traces  of  the  disease  have  disappeared,  but  it 
has  left  65  dead.  Eight  members  of  the  community  caught  the  disease  and 
managed  to  survive — it  is  assumed  that  they  would  be  immune  from  the 
disease  were  it  to  strike  again.  The  remaining  twenty-seven  were  free  from 
infection  throughout  the  attack.  □ 

One  thing  that  the  models  we  shall  be  studying  do  not  set  out  to  do  is  to  keep 
track  of  who  is  at  what  stage  of  an  infection.  It  would  be  very  complicated  indeed, 
even  in  a  small  community  of  30,  to  ‘label’  each  member  (susceptible;  ill;  dead; 


6 


recovered)  and  to  keep  a  record  of  all  30  labels  over  all  time.  All  that  will  happen 
for  the  epidemic  models  examined  in  this  course  is  that  a  record  is  kept  of  the 
numbers  in  each  category.  In  closed  communities  such  as  those  we  shall  be 
looking  at,  the  total  number  in  all  the  different  categories  added  together  remains 
the  same  at  any  time. 

A  further  assumption  deals  with  the  manner  in  which  a  disease  spreads  from 
infectious  individuals  to  susceptibles.  At  any  given  moment,  members  of  the 
community  who  are  still  alive  and  have  not  been  isolated  are  free  to  come  into 
contact  with  any  others  who  are  still  alive  and  have  not  been  isolated.  It  is 
assumed  that  any  two  such  free  individuals  are  as  likely  to  come  into  contact  as 
any  other  two.  This  is  known  as  homogeneous  mixing  (or  ‘the  principle  of  mass 
action’).  Now,  some  of  these  contacts  will  be  between  two  susceptibles:  neither  can 
transmit  the  disease,  so  neither  catches  it.  Others  will  be  between  two  infectives: 
they  both  already  have  the  disease,  so  there  are  no  new  cases.  It  is  only  when  a 
susceptible  comes  into  contact  with  an  infective  that  the  disease  is  transmitted:  the 
epidemic  develops. 

The  following  question  is  about  the  rate  at  which  a  disease  is  spread. 

Question  1.1  Suppose  that  a  village  community  consists  of  n  +  1  members.  They 
are  a  reasonably  communicative  group:  each  member  comes  into  contact  with 
others  according  to  a  Poisson  process  at  rate  /?.  (This  rate  might  be  5  per  day  in  a 
small  village.) 

(i)  Now  suppose  that  you  are  one  of  the  community:  there  are  n  others.  You 
have  contacts  according  to  a  Poisson  process  at  rate  (3.  Assuming 
homogeneous  mixing,  how  often  do  you  come  into  contact  with  the  particular 
villager  known  as  Gary?  What  is  the  contact  process? 

(ii)  Actually,  you  are  one  of  y  ill  people:  the  other  n  +  1  -  y  are  all  well.  How 
often  do  you  come  into  contact  with  a  well  person  (any  one,  it  does  not 
matter  which)? 

(iii)  Now  include  the  other  ill  people.  What  is  the  rate  at  which  ill  people  come 
into  contact  with  well  people?  (Of  course,  this  is  just  the  same  as  the  rate  at 
which  well  people  come  into  contact  with  ill  people.)  □ 

This  question  leads  to  the  following  general  result. 


This  is  just  one  of  a  number  of 
different  possible  representations  of 
the  epidemic  rate  py.  One  other 
quite  common  model  assumes  a 
kind  of  ‘crowd  effect’:  the  larger  the 
community,  then  the  more  often 
individuals  come  into  contact  with 
one  another.  An  example  is  a 
swimming  pool  into  which  more  and 
more  people  have  jumped,  though 
not  sufficiently  many  to  preclude  all 
movement.  Such  a  model  would 
have  Py  =  f}y(n  +  1  —  y).  One  has  to 
concentrate  on  one  model  or 
another;  none  is  the  ‘right’  model. 
Some  are  just  more  appropriate  to 
some  situations  than  others.  . 

All  of  the  above  assumptions  may  be  questioned  in  the  context  of  a  given 
epidemic.  Most  obviously  open  to  question  is  the  assumption  of  homogeneous 
mixing,  which  is  rarely  even  approximately  valid  Another  doubtful  one  is  the  idea 
that  contacts  occur  as  a  Poisson  process  at  constant  rate.  However,  simplifying 


In  a  community  where  there  are  y  individuals  of  one  type,  and  n  +  1  —  y  of  a 
second  type,  if  any  particular  member  of  the  community  comes  into  contact 
with  others  according  to  a  Poisson  process  at  rate  /?,  then  contacts  between 
the  different  types  will  occur  as  a  Poisson  process  at  rate 

p_Py(n+\-y) 

y  n 


The  notation  (3y  is  used  because  in  many  respects  the  epidemic  model  is  a  kind  of 
birth  process:  the  number  of  new  infections  can  only  increase.  The  only  unusual 
feature  for  the  epidemic  model  is  that  the  ‘birth’  rate  is  neither  constant  (as  in  the 
Poisson  arrival  process)  nor  proportional  to  y  (as  in  the  simple  birth  process) — it 
is  quadratic  in  y.  It  is  called  the  epidemic  rate. 

The  epidemic  rate  (3y  is  small  if  y  is  small:  there  are  not  many  infectives  around  to 
spread  the  disease.  The  rate  is  also  small  if  y  is  close  to  n  +  1 ;  that  is,  if  nearly 
everybody  has  caught  the  disease,  and  there  are  rather  few  left  who  have  still  to 
catch  it.  At  the  two  extremes  it  is  zero.  The  epidemic  rate  is  greatest  at 
intermediate  values  of  y:  there  are  many  infected  people  around  to  spread  the 
disease,  and  quite  a  number  still  available  to  catch  it. 


7 


assumptions  have  to  be  made  in  order  to  produce  models  that  can  be  analysed 
mathematically.  As  you  will  see,  even  such  simple  models  can  provide  valuable 
insights  into  the  behaviour  of  epidemics.  In  addition,  although  this  is  not  a  point 
which  will  be  developed  in  this  unit,  various  data  sets  collected  from  past  epidemic 
outbreaks  have  been  found  to  be  reasonably  consistent  with  the  predictions  of  the 
simple  models  that  we  shall  investigate. 


2  The  simple  epidemic 


The  first  model  that  we  shall  study  is  particularly  simple  in  that 

(i)  removals  are  assumed  not  to  occur: 

once  the  disease  is  caught,  individuals  do  not  recover  from  it  or,  for  that  matter, 
die  of  it.  They  show  no  symptoms  and  are  not  taken  into  isolation.  For  ever  after 
they  suffer  from  the  disease  and  are  capable  of  transmitting  it  to  others  who  have 
not  caught  it.  The  situation  is  even  more  straightforward  than  that  described  in 
Figure  1.2.  One  application  of  this  simple  model  could  be  to  a  disease  which 
spreads  so  rapidly  that  the  whole  population  is  infected  before  any  removals  have 
a  chance  of  occurring. 

For  the  simple  model  it  is  also  assumed  that 

(ii)  when  a  susceptible  individual  meets  an  infective,  then  he  catches  the  disease 
with  probability  one 

and 

(iii)  the  latent  period  is  of  length  zero. 

These  three  assumptions  of  the  simple  epidemic  mean  that  each  susceptible  on 
contact  with  an  infective  immediately  becomes  and  then  remains  an  infective. 

There  are  only  two  stages  to  the  disease,  which  are 

SUSCEPTIBLE  — ►  INFECTIVE, 

and  so  there  are  only  two  categories  to  count.  Suppose  that  the  random  variables 
X(t)  and  7(0  denote  respectively  the  numbers  of  susceptibles  and  infectives  in  the 
community  at  time  t,  and  let  2f(0)  =  x0  and  7(0)  =  y0  denote  their  initial  values  at 
time  t  =  0.  Now,  the  community  is  closed,  of  size  n  +  1,  say,  and  so 

x0  +  y0  =  X{t)  +  7(r)  =  n+  1, 
and  this  holds  for  all  t. 

It  follows,  therefore,  that  for  all  t ,  X(t)  =  n+  1  -  7(0-  So  it  is  sufficient  to  work 
with  the  single  random  variable  7(0,  the  number  of  infectives,  in  order  to  monitor 
the  progress  of  the  epidemic.  If  required,  X(t)  can  always  be  found  by  subtraction. 

Example  2.1  (Example  1.1  (i)  revisited) 

In  a  family  of  six  (two  parents  and  four  children),  one  of  the  children  catches,  a 
‘bug’.  Next  day,  both  parents  and  one  of  the  other  children  are  also  down  with  the 
same  ailment.  Two  days  later,  they  all  have  it. 

In  this  case  y0  =  1,  n  =  5  (and  it  follows  that  x0  =  5).  Taking  the  unit  of  time  to  be 
days,  then  the  next  information  we  are  given  is  that  7(1)  =  4:  it  follows  that 
2f(l)  =  2.  Finally,  7(3)  =  6  and  X(3)  =  0.  Everyone  in  the  family  has  caught  the 
disease.  There  is  really  no  point  in  keeping  track  of  the  random  process 
{*(0;  t  >  0}  as  well  as  { 7(f);  t  >  0},  for  X(r)  =  6-  Y(t )  for  all  t.  □ 

We  are  concerned,  therefore  (not  for  the  first  time),  with  an  integer-valued 
continuous-time  random  process  { 7(c);  t  >  0},  in  this  case  describing  the  number 
of  infected  individuals  in  a  closed  community  at  time  t. 


Another  application  of  this  model  is 
to  the  spread  of  a  rumour:  once 
individuals  have  heard  an  item  of 
information,  then  for  ever  after  they 
remember  it  and  are  capable  of 
passing  it  on  to  others  who  have  not 
heard  it.  This  situation  is  described 
in  M245  Unit  8,  Section  5. 


It  might  seem  to  be  deliberately 
obscure  to  postulate  a  community  of 
size  n  +  1  rather  than  n.  Actually,  it 
eases  things  just  a  little — otherwise 
the  cumbersome  number  n  —  1 
occurs  rather  a  lot. 

One  could  just  as  well  work  with 
X(t),  the  number  of  susceptibles.  But 
7(0  is  increasing  (X{t)  is  decreasing) 
and  it  seems  slightly  more 
straightforward  to  count  the  ill 
people. 


8 


2.1  Setting  up  the  model 

Because  the  process  { Y(t);  t  >  0}  described  above  is  a  sort  of  ‘birth’  process  (the 
numbers  of  infectives  are  increasing),  the  transition  probability 
P(Y(t  +  St)  =  y  +  1 1  Y(t)  =  y)  is  given  by 

P(Y(t  +  St)  =  y  +  1 1  7(0  =  y)  =  pydt  +  o(St), 
where  the  parameter  (3y  is  the  epidemic  rate.  We  must  determine  fiy.  We  shall 
assume  that  any  particular  member  of  the  community  comes  into  contact  with 
other  members  according  to  a  Poisson  process  at  rate  (3  (which  could  be 
something  like  12  per  hour  in  a  school). 

Question  2.1  If  a  particular  member  of  the  community  comes  into  contact  with 
others  at  rate  /?,  then  at  what  rate  are  contacts  between  infectives  and  susceptibles 
taking  place?  □ 

It  has  been  assumed  that  when  a  susceptible  individual  meets  an  infective,  then  he 
catches  the  disease  with  probability  one.  In  other  words,  the  number  of  infectives 
in  the  community  increases  immediately  by  one,  the  latent  period  being  of  length 
zero.  This  means  that  the  fundamental  probability  postulate  for  the  simple 
epidemic  model  is 


P(Y(t  +  St)  =  y  +  1 1  Y(t)  =  y)  =  M^±hlAdt  +  o(M) 


Notice  that  when  y  =  0,  the  epidemic  rate  is  (3y  =  0,  for  there  are  no  diseased 
individuals  from  whom  to  catch  the  disease.  Similarly,  when  y  =  n  +  1,  then  again 
(3y  =  0,  for  there  are  no  susceptible  individuals  left  to  whom  it  can  be  transmitted. 
For  intermediate  values  of  y,  (3y  >  0. 


Question  2.2  How  would  you  alter  the  simple  epidemic  model  to  represent  the 
notion  that  when  an  infective  meets  a  susceptible,  the  disease  is  passed  on  only 
with  probability  6,  0  <  9  <  1?  Would  this  refinement  be  complicated  to 
introduce?  □ 

Before  developing  the  stochastic  model  for  the  simple  epidemic,  we  shall  look  at 
the  deterministic  analogue.  This  we  have  done  before  in  the  context  of  different 
processes  such  as  the  simple  birth  process — the  deterministic  model  will  provide 
an  indicator  of  the  way  an  epidemic  is  ‘likely’  to  develop. 


The  probability  d  could  be  used  to 
reflect  the  virulence  of  the  disease; 
or  the  resistance  to  infection  of  those 
who  have  not  yet  had  it. 


2.2  The  deterministic  model 

From  Equation  (2.1)  it  follows  that  the  deterministic  model  for  the  simple  epidemic 
may  be  written  in  the  form  of  the  ordinary  differential  equation 

dy  Py(n  +  1  -  y) 

It  = - n - ’  (2-2) 

which  may  be  easily  solved  after  separation  of  the  variables. 

This  equation  could  have  been  written  in  the  form  dz/dt  =  (3z(n  +  1  —  z)/n ,  the 
change  of  notation  intended  to  emphasize  that  it  is  the  deterministic  version  that  is 
being  studied.  This  is  what  has  been  done  in  previous  units.  However,  in  this  unit 
we  shall  need  simultaneously  to  consider  deterministic  analogues  involving  more 
than  one  random  variable.  No  confusion  should  arise  from  using  the 
corresponding  lower-case  letter. 


9 


Question  2.3 

(i)  Show  that  the  differential  equation  (2.2)  has  solution  y,  where 


log 


=  fit  4-  constant. 


n  +  1  \n  +  1  —  y, 

(ii)  Find  y  in  terms  of  t ,  given  the  initial  condition  y  =  y0  =  1. 


□ 


When  there  are  initially  one  infected  individual  (y0  =  1)  and  n  susceptibles  in  the 
community,  then  the  solution  of  the  differential  equation  (2.2)  is 


y  = 


n  +  1 


nexpi  — 


1  +- 
n 


fit  +  1 


(2.3) 


whose  graph  for  n  =  8  and  /?  =  1  is  shown  in  Figure  2.1. 


You  can  see  from  Equation  (2.3)  that  y  -» n  +  1  as  t  -*  oo.  That  is,  in  the  limit  all 
the  original  susceptibles  become  infected. 


If  an  epidemic  outbreak  is  being  monitored,  then  what  are  generally  recorded  over 
time  are  the  reports  of  new  cases.  In  our  model,  the  corresponding  quantity  is  the 
rate  at  which  new  infections  occur— that  is,  the  derivative  dy/dt.  The  graph  of 
dy/dt  against  t  is  known  as  the  epidemic  curve. 


Figure  2.1  Development  of  the 
simple  epidemic — deterministic 
model 


Question  2.4  Find  dy/dt  from  Equation  (2.3)  (corresponding  to  the  case 
Fo  =  !)•  □ 


The  epidemic  curve  (with  y0  =  1), 
/?(n  +  l)2expf  — 


dy 

dt 


i  +  ‘ 

n 


fit 


nexpi  — 


1  +  - 
n 


fit )  +  1 


is  shown  in  Figure  2.2,  again  for  n  =  8,  /?  =  1. 


It  follows  from  Equation  (2.2)  that  at  time  0  the  height  of  the  epidemic  curve  is 
equal  to  fiy0(n  +  1  -  y0)/n.  Its  maximum  occurs  when  y  =  j(n  +  1)  (calculated 
using  Equation  (2.2));  in  the  special  case  y0  =  1,  this  maximum  occurs  at  time 


nlogn 
p(n  +  1) 


(2.5) 


The  graphs  in  Figures  2.1  and  2.2 
retain  the  same  overall  shape  for 
other  values  of  n  and  ji. 


Do  not  concern  yourself  with  the 
verification  of  these  facts. 


Question  2.5  At  time  zero,  an  inhabitant  of  a  village  of  total  size  112  becomes 
infected  with  a  disease. 


(i)  Assuming  that  any  particular  inhabitant  comes  into  contact  with  others  at  an 
average  rate  of  4  per  day,  how  many  villagers  are  infected  one  day  later?  How 
many  after  two  days? 

(ii)  When  is  the  disease  spreading  fastest? 

(iii)  How  long  is  it  before  the  whole  village  is  infected?  □ 


Now,  it  is  clear  for  the  simple  epidemic  as  described  that  eventually  the  whole 
community  will  become  infected.  That  this  happens  within  finite  time  is  shown  in 
Subsection  2.3.  As  you  saw  from  Figure  2.1  and  from  the  solution  to  Question 
2.5(iii),  the  deterministic  model  says  that  the  whole  community  will  become  infected 
only  after  infinite  time.  As  in  previous  applications,  the  deterministic  approach  is 
of  limited  use.  We  shall  now  proceed  to  the  stochastic  case. 


10 


2.3  The  stochastic  model 


We  now  turn  to  the  stochastic  version  of  the  simple  epidemic  model,  which  is  an 
integer-valued  continuous-time  random  process  {7(r);  t  >  0},  where  the  random 
variable  Y(t)  is  the  number  of  infectives  at  time  t.  As  in  the  deterministic  case,  we 
shall  assume  that  the  community  through  which  the  disease  is  spreading  is  closed, 
of  size  n  +  1,  and  that  initially  the  number  of  infectives  is  7(0)  =  y0.  We  shall 
attempt  to  find  the  probability  distribution 

py(t)  =  P(Y(t)  =  y) 

— strictly,  the  conditional  probability  distribution  P(Y(t)  =  y  |  7(0)  =  y0).  We 
already  have  Result  (2.1): 

P(Y(t  +  St)  =  y  +  1 1  Y(t )  =  y)  1  1  ~  yht  +  o(St). 

n 

As  the  epidemic  progresses,  the  process  {7(t);  t  >  0}  passes  as  a  birth  process  from 
its  initial  state  y0  through  the  states  y0  +  1,  y0  +  2, n  -  1,  n,  until  the  state 
n  +  1  is  reached,  at  which  point  all  the  susceptibles  have  been  infected  and  the 
epidemic  has  terminated. 


Question  2.6  Modelling  the  simple  epidemic  as  one  of  the  family  of  general  birth 
and  death  models,  show  that  the  differential-difference  equations  for  the 
probability  distribution  of  Y(t)  for  the  simple  epidemic  are 


Py{n  +  1  -  y)  (${y 

- - - Py(t)  + - 


\){n  +  2  -  y) 
n 


Py—  l(0> 


y=  1,  2,  n  +  1.  □  (2.6) 


In  principle  the  probability  distribution  of  7(t)  can  be  deduced  from 
iterative  solution  of  the  differential-difference  equations  (2.6).  The  first  one  to 
solve  (y  =  1)  is 

jtPi{t)  =  ~PPi(t); 
the  next  is 

^£2(0  =  —2/?^  - — Sjp2(t)  +  /? Pi(0; 

and  so  on.  The  last  (y  =  n  +  1)  is 
d 

JtPn+  l(f)  =  Ppn(t)- 


In  the  past,  however,  we  have  used  the  set  of  differential  equations  to  obtain  a 
partial  differential  equation  for  the  probability  generating  function 

U(s,t)  =  ^  py{t)sy,  (2.7) 

y  =  0 


which  has  often  proved  to  be  a  much  more  elegant  and  straightforward  way  of 
obtaining  the  probability  distribution  (py(t)}  of  the  random  variable  7(t).  In  this 
case  it  can  be  shown  that  the  partial  differential  equation  for  the  probability 
generating  function  Tl(s,t)  for  the  simple  epidemic  is  of  the  form 


an 

dt 


sa2n\ 

n  ds2  ) 


(2.8) 


Now,  the  partial  differential  equation  (2.8)  for  the  probability  generating  function 
n(s,  t)  is  not  of  the  Lagrange  form  which  in  Unit  7  you  learned  how  to  solve,  and 
which  you  have  come  across  several  times  since.  The  presence  of  the  term  a2n/as2, 
a  second-order  partial  derivative,  is  what  makes  the  difference,  and  this  follows 
from  the  fact  that  the  parameter  is  quadratic  in  y.  So  it  looks  as  though  the 
only  way  to  derive  the  probability  distribution  of  Y(t)  in  a  given  situation  is,  after 
all,  through  iterative  solution  of  the  differential  equations  (2.6)  for  {py(t)}.  This 
approach  is  illustrated  in  the  following  example. 


. . .  unless  you  have  studied  the 
solution  of  such  partial  differential 
equations  as  (2.8)  in,  say,  MST322 
Unit  9. 


11 


Example  2.2 

In  a  household  of  size  5,  initially  3  have  an  infectious  disease.  Assuming  that  /?  is 
equal  to  2,  find  the  probability  distribution  of  the  number  of  infected  persons  at 
time  t. 

.  Solution 


With  three  individuals  infected  initially,  it  follows  that  p^t)  =  p2(t)  =  0  for  all  t. 
The  differential  equations  necessary  to  find  the  probability  distribution  of  Y{t)  are 

Jt  Py(t)  =  ~  Py  Py(t )  +  Py-lPy-X  (4  F  =  3,  4,  5, 


where,  since  /?  =  2  and  n  +  i  =  5, 

/ 8y(n  +  1  -  y)  2 y(5  -  y)  \  /c 

= - n - - 4 - 2^5  _  •y)- 

S°  P$  —  3,  P4  —  2  and  /?5  =  0,  and  we  have  the  three  differential  equations 

JtP&)  =  —  3Pa(05 
d 

jtP 4(0  =  ~2p4{t)  +  3p3(t) 


and 


A 

dt 


Ps(t)  =  2p4(t). 


(2.9a) 

(2.9b) 

(2.9c) 


Solution  of  the  first  differential  equation  (2.9a)  gives 
/bW  =  constant  x  e~3t, 

and  using  the  initial  condition  p3( 0)  =  1  (three  infected  people  to  begin  with),  it 
follows  that  the  constant  is  1.  So 

Pi(t)  =  e~3t. 

Next,  setting  this  result  into  the  second  equation  (2.9b)  and  rearranging  the  terms, 
we  have 


jtP4{t)  +  2p4{t)  =  3e  3t. 


(2.10) 


Now  this  equation  is  of  the  form  dp/dt  +  A{t)p  =  B(t).  Solution  is  eased  by 
multiplying  both  sides  by  the  integrating  factor  exp  ($A(t)dt).  In  this  case  A(t)  =  2, 
and  so  \A(t)dt  =  It.  Therefore  we  multiply  both  sides  of  Equation  (2.10)  by  e2t: 

(^4(o)e2'  +  2  p4(t)e2t  =  3e~\ 


which  is  the  same  as 

jt(pi(t)e2t)  =  3e~l. 


Integrating  this  gives 

p4{t)e2t  =  constant  —  3c-'. 

From  the  initial  condition  p4(0)  =  0  (three  infected  people  to  begin  with),  it  follows 
that  the  constant  is  3,  so 

p4(t)  =  3e~2t  —  3e~3t. 

Finally,  setting  this  result  into  the  last  equation  (2.9c),  we  have 
jtPs(t)  =  6e-2t  -6e~3t, 
whose  solution  is 

Ps(t)  =  —  3c-2'  +  2c-3'  +  constant. 

Since  p5(0)  =  0,  it  follows  that  the  constant  is  1.  This  gives 
p5(t)  =  1  -  3e-2'  +  2c-3'. 


12 


The  probability  distribution  of  Y(t)  may  be  summarized  as 
p2(t)  =  e~3t, 
p4{t)  =  3e~2t  —  3e~3t, 

Ps(t )  =  1  —  3e~2t  +  2e~3t. 

Of  course,  p3(t)  +  p4(t)  +  Ps(t)  =  1.  This  is  a  useful  check  on  one’s  algebra.  □ 

Incidentally,  from  the  probability  distribution  of  Y{t)  it  is  fairly  straightforward  to 
write  down  the  corresponding  probability  generating  function  II(s,r). 

Question  2.7  In  this  question  you  are  asked  to  find  the  way  a  disease  spreads 
through  a  household  of  size  four,  where  initially  one  individual  is  infected.  As  in 
Example  2.2,  this  is  done  by  calculating  the  probability  distribution  of  the  random 
variable  Y(t),  the  total  number  of  members  of  the  household  infected  at  time  t.  The 
value  of  (3  is  taken  to  be  3. 

(i)  Determine  the  sequence  of  differential  equations  for 

py(t)  =  P(Y(t)  =  y),  y  =  1,2,  3,4, 
and  write  down  the  initial  values  px( 0),  p2(0),  p3(0),  p4(0). 

(ii)  Solve  the  differential  equations  with  initial  conditions  found  in  part  (i). 

(iii)  What  happens  to  p^t),  p2(t),  p3{t)  and  p4{t)  as  t ->  oo?  Interpret  your 
answer.  □ 

2.4  The  duration  of  the  simple  epidemic 

If  we  define  the  duration  W  of  the  epidemic  to  be  the  waiting  time  until  the  last 
remaining  susceptible  is  infected,  then  we  can  write  its  probability  distribution  in 
terms  of  the  probability  distribution  of  Y(t),  the  number  of  infectives,  using  the 
identity 

P(W<t)  =  P(Y(t)  =  n+  1),  (2.11) 

for  if  everybody  has  contracted  the  disease  by  time  t  (7(f)  =  n  +  l),  then  the  last 
person  must  have  caught  it  at  or  before  time  t. 

If  the  probability  distribution  of  Y(t)  were  known,  then  this  would  be  a 
straightforward  exercise.  If  not,  then  it  would  be  a  long  process  to  find  the 
probabilities  P( 7(0  =  y0),  P(Y{t)  =  y0  +  1), P(Y{t)  =  n  +  1)  in  order  to  obtain 
the  distribution  of  W. 

An  alternative  approach  is  as  follows.  Suppose  that  at  time  0  there  are  y0 
infectives.  Then  the  waiting  time  Tx  until  the  first  infection  of  a  susceptible  is 
exponential  with  parameter  j?yo:  Tx  ~  M{pJ.  Similarly,  and  independently  of  this, 
the  waiting  time  between  the  first  and  second  infection  is  T2  ~  M(^,o  +  1).  The 
epidemic  continues:  it  is  complete  with  the  infection  of  the  (n  +  l)th  member  of  the 
community,  that  is,  at  the  (n  +  1  -  y0)th  infection:  Tn+1_yo  ~  M{pn).  The  duration 
W  of  the  epidemic  is  equal  to  the  sum  of  all  the  waiting  times: 

n+  i-y0 

w=  X  Tj. 

j=  i 

The  distribution  of  W  is  not  simple,  consisting  as  it  does  of  the  sum  of  exponential 
variates  with  different  parameters.  However,  its  mean  and  variance  are  easily 
found: 

£(»o=z},  y(w)=£Y  (2.i2) 

j  =  yo  r  j  j=y0  Pj 


13 


Example  2.3 

In  Example  2.2  (for  which  y0  =  3,  n  +  1  =  5,  p  =  2)  we  found  that 
P(Y(t)=  5)=  1  -  3£T2'  +  2e~3t, 
and  so  for  this  example,  using  Equation  (2.11),  we  have 

P(W<  t)  =  1  -  3e~2t  +  2<T3'.  (2.13) 

We  shall  find  E(W)  by  using  the  definition  and  then  check  it  by  using  Result  (2.12). 
Since  W  >  0, 


E{W)  = 


P(W>  t)dt; 


f  / 

Unit  1,  Equation 


so 


E{W)  = 


(3e  ‘  —  2e  )dt,  using  Equation  (2.13), 


jo 

3  2  5 

2  —  3  —  6- 


This  result  is  easily  checked,  as  follows.  We  had  found  /?3  =  3,  /?4  =  2  and  from 
Result  (2.12), 

E(W)=  i  1=  t^  =  i  +  i  =  i 

j=yo Pj  j=3  r  j 

The  variance  of  W  is  V(W)  =  1/32  +  1/22  =  1/9  +  1/4  =  13/36;  this  is  much  easier 
than  using  Result  (2.13).  □ 


It  is  easy  to  write  down  the  value  of 


the  integral 


ie  dt  immediately. 


It  is  (a//?) 


I  Pe-p,dt, 
Jo 


which  is  just 


z/P,  since  Pe~pt  is  the  probability 
density  function  of  M(p),  which  has 
range  x  >  0. 


Question  2.8  Find  the  cumulative  distribution  function  of  the  duration  of  a 
simple  epidemic  where  y0  =  1,  n  =  3  and  p  =  3  (see  Solution  2.7).  Use  this  to  find 
the  expected  duration,  and  check  your  answer  using  Result  (2.12).  What  is  the 
variance  of  W3  □ 


Question  2.9  In  the  case  of  a  simple  epidemic  with  y0  =  1,  show  that  the 
expected  duration  is 


E(W)  = 


2  n  y  1 

(n+Wjh]’ 


and  evaluate  this  when  p  =  1  and  (i)  n  =  5,  (ii)  n=  10,  (iii)  n  =  20. 


□ 


If  you  studied  M245,  you  will  have 
used  this  approximation  before. 


What  these  numerical  results  illustrate  is  that  the  larger  a  community,  then  the 
longer  on  average  it  takes  a  disease  to  spread  through  it.  This  average  time  is 
roughly  proportional  to  the  logarithm  of  the  community  size  (so  it  takes  not  ten 
times  as  long  but  only  twice  as  long  for  a  disease  to  spread  through  a  community 
of  size  100  as  through  one  of  10). 

Other  models  can  give  quite  different  results:  for  instance,  it  might  be  appropriate 
in  some  cases  to  use  a  model  where  a  disease  spreads  through  a  large  community 
more  rapidly  than  through  a  small  one.  You  need  not  worry  about  the  details. 


Question  2.10  A  useful  result  is 

i  +  >  +  i  +  ...  +  I^iog(„  +  i)-_j_+0.5772. 

Repeat  Question  2.9(i)-(iii)  using  this  approximation.  What  is  the  expected 
duration  of  the  simple  epidemic  (iv)  when  n  =  200,  (v)  when  n  =  2000?  □ 


14 


3  The  general  epidemic 


The  model  that  we  shall  investigate  in  this  section  and  the  next  was  first  developed 
in  the  1920s  by  W.  O.  Kermack  and  A.  G.  McKendrick,  although  a  thorough 
investigation  of  the  stochastic  version  was  begun  by  M.  S.  Bartlett  only  in  the 
1950s.  The  term  general  epidemic  has  come  to  be  commonly  used  for  this  model, 
although  many  more  general  models  have  since  been  developed. 

The  assumptions  of  the  model  are,  as  before,  that  the  epidemic  progresses  through 
a  community  when  an  infective  meets  a  susceptible  and  the  disease  is  passed  on: 
the  number  of  infectives  increases  by  one,  and  the  number  of  susceptibles  is 
correspondingly  reduced  by  one.  For  the  simple  epidemic  model  of  the  previous 
section,  the  disease  progresses  in  this  way  through  the  whole  of  the  community 
until  everybody  has  caught  it  and  there  is  nobody  left.  However,  in  the  general 
model,  a  third  category  is  introduced,  to  cover  a  situation  where 

individuals  infected  with  the  disease  are,  after  some  random  time, 
prevented  from  communicating  it  further — through  isolation,  or  through 
recovery,  or  perhaps  because  they  have  died. 

These  individuals  are  referred  to  as  having  been  removed.  Whatever  the  reason  for 
removal,  it  follows  that  as  well  as  increasing  when  susceptibles  become  infected 
with  the  disease,  the  class  of  infectives  also  decreases  when  infectives  are  removed. 
Any  individual  who  is  infected  necessarily  passes  through  the  three  stages 

SUSCEPTIBLE  — ►  INFECTIVE  — >  REMOVED. 

What  sort  of  course  could  the  epidemic  follow?  The  disease  could,  as  in  the  simple 
case,  affect  everybody  in  the  community.  But  there  is  another  possibility,  and  this 
is  that  the  class  of  infectives  is  reduced  through  removal  to  zero,  before  all  the 
susceptibles  have  become  infected.  The  disease  has  run  its  course,  and  some 
members  of  the  community  remain  free  of  the  disease,  there  being  nobody  left  from 
whom  to  catch  it.  The  sorts  of  questions  one  might  then  be  interested  in  answering 
are:  What  is  the  probability  that  not  everybody  in  a  community  is  afflicted?  What 
is  the  average  number  of  persons  to  catch  the  disease?  How  long  does  the 
epidemic  last? 

In  order  to  speak  of  the  various  categories  involved  in  the  general  epidemic, 
suppose  as  before  that  X{t)  denotes  the  number  of  susceptibles  at  time  t  and  that 
Y(t )  denotes  the  number  of  infectives.  Further,  suppose  that  the  random  variable 
Z(t )  denotes  the  number  removed  by  time  t.  At  time  t  =  0,  the  initial  values  are 
.<7(0)  =  x0,  7(0)  =  j/0>  Z(0)  =  z0.  Then,  assuming  once  again  that  the  community  is 
closed  and  of  size  n  +  1,  the  relationship 

*o  +  y0  +  20  =  +  no  +  no  =  n  +  l 

holds  for  all  t.  (For  the  epidemic  to  stand  a  chance  of  starting  at  all,  it  will  always 
be  assumed  that  y0  >  1  and  x0  >  1.  Also,  in  all  the  cases  we  consider,  z0  =  0.) 


That  is,  assumptions  (ii)  and  (iii)  of 
the  simple  epidemic  apply  also  in 
the  case  of  the  general  epidemic. 


That  is,  assumption  (i)  of  the  simple 
epidemic  does  not  apply  here. 


Example  3.1  (i)  (Example  1.1  (ii)  revisited) 

A  small  school  comprises  30  young  children.  On  Sunday  (unknown  to  parents  and 
teachers)  one  of  them  catches  an  infectious  disease,  and  takes  it  to  school  on 
Monday.  On  Tuesday,  7  children  are  ill  with  the  disease,  and  are  infectious  (and 
the  other  23  are  well,  but  susceptible).  On  Wednesday,  3  have  recovered,  18  are  ill 
and  9  have  yet  to  catch  the  disease.  By  Friday  there  are  13  ill  children — all  the 
remaining  17  children  have  had  the  disease  but  have  recovered.  By  the  next 
Monday,  all  30  children  are  well  again. 

In  this  case,  if  we  take  the  unit  of  time  to  be  days,  the  disease  progresses  as 
follows. 


Monday: 

J7(l)  =  29 

7(1)  =  1 

Z(  1)  =  o 

Tuesday: 

X{2)  =  23 

7(2)  =  7 

Z(  2)  =  0 

Wednesday:  X(3)  =  9 

7(3)  =  18 

N 

II 

Friday: 

o 

II 

(n 

7(5)  =  13 

Z(5)  =  17 

Monday: 

X(8)  =  0 

7(8)  =  0 

Z(8)  =  30 

15 


Notice  that  X(t)  +  7(0  +  Z(t)  =  30  for  all  t.  (Z(t)  can  be  found  by  subtraction,  if 
the  number  of  infectives  and  remaining  susceptibles  is  known.)  As  it  happens,  in 
this  example  everybody  ends  up  catching  the  disease.  This  does  not  always 
happen.  □ 

Example  3.1(ii)  (Example  1.1  (Hi)  revisited) 

Plague  breaks  out  in  a  small  peasant  hamlet,  where  there  are  a  hundred 
inhabitants.  A  month  later  all  traces  of  the  disease  have  disappeared,  but  it  has  left 
sixty-five  dead.  Eight  members  of  the  community  caught  the  disease  and  managed 
to  survive— it  is  assumed  that  they  would  be  immune  from  the  disease  were  it  to 
strike  again.  The  remaining  twenty-seven  were  free  from  infection  throughout  the 
attack. 

Taking  the  unit  of  time  to  be  months  (assuming  y0  >  1  and  z0  =  0),  the  end 
result  is 

X(l)  =  27,  7(1)  =  0,  Z(l)  =  73. 

In  this  case  the  number  of  removed  persons  is  found  by  adding  together  those  who 
died  from  the  disease  (65)  as  well  as  those  who  recovered  (8).  This  situation  is 
more  complicated  than  any  we  shall  consider:  the  ‘recovery’  rate  and  ‘death’  rate 
would  probably  be  different,  for  a  start.  But  more  particularly,  removal  through 
death  would  also  reduce  the  total  population  size,  and  consequently  affect  the 
epidemic  rate.  □ 

In  this  course  it  will  usually  be  assumed  that  there  is  only  one  reason  for 
removal  recovery  from  the  disease,  after  which  the  individual  is  immune  from 
further  attacks — and  that  the  rate  at  which  recovery  takes  place  is  known,  and  is 
the  same  for  every  infected  individual.  However,  the  recovered  individual  remains 
in  the  population,  maintaining  contact  with  the  others. 

At  the  end  of  this  section  we  shall  briefly  look  at  a  model  where  ‘removal’  means 
precisely  that,  after  some  random  time,  a  person  infected  with  the  disease  dies 
from  it.  Or  possibly,  after  some  random  time,  an  infected  person  displays 
symptoms  of  the  disease  and  is  taken  into  isolation.  In  either  case,  the  slight 
adjustment  necessary  to  the  model  leads  to  some  increase  in  mathematical 
complexity,  and  we  shall  not  pursue  the  model  beyond  a  brief  mention. 


3.1  Setting  up  the  model 

Although  there  are  now  three  categories  under  consideration,  the  susceptibles  (Z), 
the  infectives  (7)  and  the  removed  (Z),  it  follows  from  the  fact  that  they  all  reside 
in  a  closed  community  of  known  total  size  that  only  two  need  to  be  counted  in  the 
analysis.  We  shall  count  only  the  susceptibles  X(t)  and  the  infectives  7(t)— we  can 
always  obtain  Z{t)  by  subtraction  if  we  want  to  know  how  many  individuals  have 
recovered  from  the  disease  by  time  t. 

Nevertheless,  we  now  have  no  longer  a  univariate  random  process  to  analyse,  but 

the  bivariate  random  process 

{(*(!),  no);  t  >  0} 

with  a  state  space  which  takes  values  not  over  the  set  of  non-negative  integers  but 
over  the  two-dimensional  integer  lattice 

{(x,y):  x  =  0,  1,  2, ...,  n  +  1;  y  =  0,  1,  2, ...,  n  +  1;  x  +  y  <  n  -f  1}. 

The  two  variates  X(t)  and  7(f)  could  be  considered  separately,  but  it  would  not 
really  be  sensible  to  do  so,  for  the  way  in  which  the  disease  spreads  through  the 
community  depends  on  both  of  them. 

The  random  process  {(X(t),  Y(t))}  changes  state  whenever  one  of  two  things 
happens.  The  first  is  that  an  infective  meets  a  susceptible,  and  the  disease  is  passed 
on.  The  X-class  goes  down  by  one,  and  the  7-class  goes  up  by  one— actually  the 
same  individual,  but  we  need  not  bother  about  this  as  we  are  not  concerned  with 
labelling  individual  members  of  the  community.  The  Z-class  does  not  alter— 
nobody  is  removed. 


16 


If  there  are  x  susceptibles  and  y  infectives  at  time  t,  then  at  that  stage  of  the 
epidemic  the  average  rate  at  which  infectives  come  into  contact  with  susceptibles  is 

fay 

n 

The  only  other  thing  which  might  happen  is  that  an  infective  is  removed— he 
recovers  from  the  disease.  Suppose  generally  that  a  susceptible  who  becomes 
infected  with  the  disease  remains  in  a  position  to  infect  others  only  for  some 
random  time,  specifically  for  some  exponentially  distributed  time  M(y).  Now,  if 
there  are  y  infectives  at  time  t,  then  the  average  rate  at  which  they  are  being 
removed  is 

yy. 

(The  recovery  process  is,  in  this  respect,  exactly  like  the  pure  death  process  of 

Unit  7.)  In  this  case  the  T-class  is  reduced  by  one;  the  size  of  the  X-class  does  not 
alter. 

These  two  transitions  of  state  can  be  summarized  in  the  form  of  two  conditional 
probability  statements: 

P(X(t  +  St)  =  X  -  1,  Y(t  +  St)  =  y  +  1 1  X(t)  =  x,  Y(t)  =  y)  =  Mst  +  o(6t)  (3.1a) 

and 

P(X(t  +  dt)  =  x,  Y(t  +  5t)  =  y  —  1 1  X(t)  =  x,  Y{t)  =  y)  =  yy  St  +  o{St).  (3.1b) 

This  is  a  more  complicated  problem  in  probability  than  any  we  have  encountered 
so  far,  because  two  variates  are  involved.  The  sorts  of  probabilities  involved  will  be 
from  the  bivariate  distribution 

P(X{t )  =  x,  Y(t)  =  y) 

-strictly,  P(X(t)  =  x,  Y(t)  =  y\X(0)  =  x0,  Y( 0)  =  y0)-and  presumably  there  is 
some  sort  of  bivariate  analogue  of  the  univariate  probability  generating  function 
n(s,  t).  (There  is,  but  we  shall  not  deal  with  it.)  We  shall  leave  this  stochastic  model 
to  the  next  section,  and  for  the  remainder  of  this  section  deal  with  the 
deterministic  formulation  of  the  general  epidemic  model,  which  is  simpler  to 
analyse.  We  can  use  the  probability  statements  (3.1a)  and  (3.1b)  to  set  up  not  one 
but  the  two  ordinary  differential  equations  which  constitute  the  deterministic 
model. 


3.2  The  deterministic  model 


There  are  now  two  different  variates  to  consider,  X(t)  and  Y(t).  It  follows  from 
Equations  (3.1a)  and  (3.1b)  that  in  the  case  of  susceptibles, 

P(X(t  +  St)  =  x  -  1 1  X(t)  =  X,  Y(t )  =  y)  =  OZZst  +  „(54 

while  in  the  case  of  infectives, 


P(Y(t  +  dt)  =  y  -  1 1  X(t )  =  X,  Y(t)  =  y)  =  yySt  +  o(St) 
and 


(3.3a) 


P(Y(t  +  St)  =  y  +  1 1  X(t)  =  x,  Y(t )  =  y)  =  +  o(5t). 


(3.3b) 


For  the  susceptibles,  the  corresponding  deterministic  formulation  is 
straightforward:  from  Equation  (3.2),  the  differential  equation  requiring  solution  is 


dx 

dt 


pxy 

n 


(x  >  0,  y  >  0). 


(3.4a) 


We  shall  continue  to  suppose  that 
the  average  rate  at  which  each 
specified  individual  meets  others  is 
/?,  as  for  the  simple  model. 


17 


The  infectives  can  have  both  increments  and  decrements.  In  the  notation  of  Unit  7, 
Equation  (4.4), 


hi(y,t)  =  — ,  h-l(y,t)  =  yy; 
so  the  corresponding  differential  equation  for  y  is 
^  =  hi(y>t)-h-l{y,t) 


=  —  -yy  (x  >  0,  y  >  0). 


(3.4b) 


(Notice  that  were  we  interested  in  counting  the  removals,  then  we  would  have 


dz 

di=yy' 


and  so 


1 

dt 


(x  +  ^  +  z)  =  |  + 


dy  dz 

di+di 


yy  +  (yy) 


=  o, 

with  solution  x  +  y  +  z  =  constant;  which  of  course  is  the  case,  for  the  total  in  all 
the  categories  is  the  population  size,  n  +  1.) 


Equations  (3.4a)  and  (3.4b)  are  simultaneous  differential  equations  in  x  and  y  at 
time  t,  with  solutions  for  x  and  y  of  the  form 

x  =  some  function  of  t, 
y  =  some  other  function  of  t. 

In  fact,  it  is  not  possible  to  obtain  these  functions  explicitly  (not  all  differential 
equations  can  be  solved  in  closed  form).  But  it  is  possible  to  see  how  x  and  y  at 
time  t  are  related  to  one  another,  by  establishing  a  relationship  between  them 
using  Equations  (3.4a)  and  (3.4b)  and  eliminating  the  variable  t. 


This  is  achieved  by  dividing  one  equation  by  the  other:  it  does  not  matter  which 
divides  which,  but  in  this  case,  dividing  the  second  (3.4b)  by  the  first  (3.4a)  gives 


dy/dt  =  (pxy/ri)  -  yy 
dx/dt  —  flxy/n 


(x  >  0,  y  >  0). 


(3.5) 


It  is  possible  to  simplify  both  the  left-hand  side  and  the  right-hand  side  of 
Equation  (3.5) — the  right-hand  side  simplifies  in  a  quite  straightforward  manner. 
The  left-hand  side  simplifies  to  dy/dx  (‘multiplying  numerator  and  denominator  by 
df).  This  you  might  not  have  seen  before,  but  you  may  be  assured  that  it  is  a  valid 
simplification  in  this  case.  Cancelling  the  ys  and  multiplying  the  numerator  and 

denominator  of  the  right-hand  side  by  n/fi  gives  - — Hence,  Equation  (3.5) 
becomes 


dy  p  —  x 
dx  x 


(x  >  0), 


where  the  ratio  ny/fi  has  been  written  as  p. 

>  0  when  x  <  p;  ^  <  0  when  x  >  p. 
dx  dx 


Observe  that: 


d l 

dx 


=  0  when  x  =  p; 


(3.6) 


Question  3.1 

(i)  Show  that  the  general  solution  y  to  the  differential  equation  (3.6)  satisfies 

x  +  y  =  P  log  x  +  constant  (x  >  0,  y  >  0).  (3.7) 

(ii)  Using  the  initial  values  x0  and  y0,  evaluate  the  constant  term  in  Equation  (3.7), 
and  hence  find  an  expression  relating  x  and  y,  respectively  the  number  of 
susceptibles  and  infectives  at  time  t  in  the  deterministic  general  epidemic.  □ 


18 


The  expression 


>>o  -  y  =  P  log 


-  (x0  -  x) 


(3-8) 


describes  the  course  or  trajectory  of  the  epidemic  in  the  positive  quadrant  of  the 
(x,y)-plane.  Its  starting  point  (f  =  0)  is  the  point  (x0,y0),  where  it  is  assumed  that 
x0  >  0  and  y0  >  0.  Figure  3.1  shows  a  typical  trajectory  for  the  case  in  which 
x0  >  p.  The  arrow  indicates  the  direction  of  increasing  time.  That  this  is  the  shape 
of  the  trajectory  can  be  deduced  from  Equations  (3.4a)  and  (3.4b).  From  Equation 
(3.4a),  for  positive  values  of  x  and  y,  x  decreases  (from  x0)  as  t  increases  from  0. 

From  Equation  (3.4b),  >  0  when  x  >  p;  ^  =  0  when  x  =  p;  ^  <  0  when 

at  dt  r  dt 


x  <  p;  so  y  increases  from  y0  to  a  maximum  value,  ymax,  when  x  =  p,  and  then 
decreases. 


The  curve  in  Figure  3.1  suggests  that  as  y  approaches  0,  x  approaches  some 
positive  value.  The  epidemic  is  over  when  there  are  no  infectives  (y  =  0)  to  spread 
the  disease  (they  have  all  been  removed).  As  you  will  see  in  Question  3.2, 
according  to  the  deterministic  formulation,  there  will  always  be  some  susceptible 
individuals  left,  who  have  lost  their  chance'  to  catch  the  disease.  The  number  left 
is  denoted  by  x*.  The  subscript  oo  is  used  because  it  can  be  shown  that  y  =  0 
when  t  =  oo :  x^  is  the  value  of  x  corresponding  to  y  =  0;  see  Figure  3.2.  In  fact, 
substituting  y  =  0  in  Equation  (3.8)  gives  an  equation  for  xffl. 


Question  3.2  Use  Equation  (3.8)  to  write  down  an  equation  for  the  number  of 
susceptibles  remaining  (xj  when  there  are  no  infectives  left  (that  is,  when 

y  =  0).  □ 


You  should  have  discovered  from  Question  3.2  that  the  number  x^  of  remaining 
susceptibles  is  found  using  the  equation 


Pl°8(^x)  “  ~x^  =  y°' 


(3.9) 


This  equation  turns  out  to  have  two  solutions,  both  positive:  one  greater  than  p 
and  one  less  than  p.  One  of  these  is  x^. 


From  Figure  3.2,  for  which  x0  >  p,  you  can  see  that  the  trajectory  of  the  epidemic 
is  horizontal  when  x  =  p.  In  this  case,  it  follows  from  Equation  (3.9)  that  the 
number  of  infectives  in  the  community  is  at  its  maximum  of 

ym™  =  y0  + x0- p  -  p\og(^j.  (3.10) 

In  this  case,  we  have  xw  <  p. 


On  the  other  hand,  if  the  initial  number  of  susceptibles  is  x0  <  p,  so  that  for  all 
subsequent  x,  x  <  p,  then  from  Equation  (3.4b)  y  decreases  from  y0  and  the 
trajectory  of  the  epidemic  is  as  shown  in  Figure  3.3:  there  is  no  peak  of  activity. 
Again  we  have  x*,  <  p. 


Question  3.3  Sketch  the  trajectory  of  the  general  epidemic  in  the  case  x0  =  p.  Is 
x*  less  than  p  in  this  case?  □ 

In  each  case,  it  is  the  smaller  solution  of  Equation  (3.9),  the  one  less  than  p,  which 
gives  x oq.  This  equation  can  be  rewritten,  taking  exponentials,  as 

x  =  x0elx~(x°+yo)],p,  (3-11) 

and  to  this  can  be  applied  the  formula  iteration  method.  Starting  with  a  guess  at 
*«>  (something  less  than  p:  x^  q  =  0  will  always  do),  successively  better  guesses  can 
be  generated  using  the  recurrence  equation 

Xooj+i  =  xoexpflx^,-  -  (x0  +  y0)]/p),  j  =  0,  1,  2,  ...  . 


Figure  3.1  The  trajectory  of  the 
general  epidemic,  x0  >  p 


Figure  3.2  The  general  epidemic, 

x0  >  p 


x0  <  p 


/ 

/ 

i - > 

Or  use  the  Newton-Raphson 
iteration  method  to  find  x^,  if  you 
prefer. 


19 


Example  3.2 

For  a  community  of  size  21  (n  =  20)  and  where  y0  =  1,  so  x0  =  20,  some  values  of 
for  four  different  values  of  p  =  ny/fi  are  given  in  Table  3.1.  The  iterative 
procedure  for  when  p  =  10  is 

Xooj+i  =  20exp((xooj  -  2 1  )/l 0). 

‘If  *oo,o  is  taken  to  be  zero,  then  x =  20e~2  1  =  2.449,  =  3.129,  and  so  on. 

The  iterative  procedure  converges  to  x^,  =  3.46. 

The  numbers  in  the  table  make  sense:  if  p  is  low,  then  either  the  recovery  rate  y  is 
low,  or  the  rate  of  communication  p  is  high  (or  both).  In  any  case,  rather  a  lot  of 
individuals  will  catch  the  disease  and  so  very  few  will  survive  uninfected.  On  the 
other  hand,  a  high  value  of  the  parameter  p  corresponds  to  a  quick  recovery  rate: 
individuals  recover  from  the  disease  before  they  are  in  a  position  to  pass  it  on,  so 
many  will  survive  uninfected.  □ 

Question  3.4  Assume  that  a  general  epidemic  develops  in  a  closed  community 
where  initially  there  are  one  diseased  individual  and  30  well  individuals.  Any 
particular  member  of  the  community  meets  others  at  an  average  rate  of  5  per  day. 
If  an  infected  person  meets  somebody  who  is  susceptible  (that  is,  anybody  well), 
the  probability  that  the  disease  is  transmitted  is  0.08.  If  a  person  does  catch  the 
disease,  then  he  or  she  becomes  infectious  immediately,  remaining  so  for  a  time 
which  is  exponentially  distributed  with  mean  6  days.  Then  he  is  considered  to 
have  recovered  from  the  disease  and  to  be  immune  from  further  attacks. 

(Hint:  If  you  have  forgotten  how  to  incorporate  the  probability  0.08  into  the 
model,  look  back  at  Solution  2.2.) 

(i)  Write  down  the  numbers  x0,  y0,  z0  and  n,  the  parameters  ft  and  y,  and  the 
number  p. 

(ii)  When  the  number  of  infectives  is  at  its  greatest,  how  many  susceptibles  are 
left?  How  many  infectives  are  there?  How  many  have  caught  the  disease  and 
have  already  recovered  from  it? 

(iii)  Use  the  formula  iteration  method  to  determine  how  many  people  of  the 
original  31  have  caught  the  disease  by  the  end  of  the  epidemic. 

(iv)  Sketch  the  trajectory  of  the  epidemic.  □ 

3.3  The  threshold  phenomenon 

An  important  aspect  of  the  conclusions  drawn  from  the  analysis  of  the  general 
model  may  be  expressed  as  the  threshold  phenomenon.  The  number  p  =  ny/P  is  a 
threshold  value  for  the  initial  number  of  susceptibles,  as  can  be  seen  from 
Figure  3.4. 


Figure  3.4  The  threshold  phenomenon 

If  P  >  x0  and  a  trace  of  infection  is  introduced  into  the  population  (y0  >  0),  then 
the  infective  number  y(t)  decreases  monotonically  and  the  epidemic  quickly  dies 

out.  If,  however,  p  <  Xq  and  a  trace  of  infection  is  introduced,  then  the  infective  The  case  xq  =  p  is  essentially  the 

number  builds  up,  a  genuine  epidemic  outbreak  occurs,  and  the  number  of  same  as  the  case  x0  <  p. 

susceptibles  surviving  uninfected  when  the  epidemic  terminates  is  less  than  p. 

Associated  with  the  threshold  phenomenon  is  the  concept  of  ‘herd  immunity’.  If,  in 
some  population,  the  number  of  individuals  susceptible  to  a  disease  is  less  than  the 
value  of  p  (so  that  x0  <  p),  then  the  population  as  a  whole  is  safe  from  epidemic 
outbreaks  of  the  disease.  If  an  immunisation  campaign  is  carried  out,  then  it  is  not 
necessary  for  the  whole  population  to  be  rendered  immune  to  the  disease  as  long 


Table  3.1  Numbers  of  susceptibles 
surviving  uninfected  out  of  20 


p 

*00 

5 

0.320 

10 

3.46 

20 

14.3 

40 

19.0 

20 


as  the  number  of  susceptibles  remaining  is  less  than  p.  Public  health  control  may 
be  aimed  not  only  at  reducing  the  number  of  susceptibles  but  also  at  increasing 
the  value  of  p  by  increasing  y,  the  removal  rate,  by  reducing  p,  the  communication 
rate,  or  by  reducing  the  probability  that  on  contact  the  disease  is  transmitted. 


3.4  A  variation  on  the  model 


We  have  assumed  that  individuals  who  contract  a  disease  recover  from  it  after 
some  random  time.  Thereafter  they  are  not  capable  of  transmitting  the  disease,  or 
of  catching  it  a  second  time.  But  they  remain  in  the  community,  and  continue  to 
play  a  role  in  the  sequence  of  contacts  between  individuals. 


An  alternative  scenario  (one  of  many)  is  that  some  random  time  after  contracting 
the  disease,  symptoms  appear,  and  the  affected  individual  is  isolated;  or  perhaps 
the  disease  is  rather  serious,  and  leads  eventually  to  death.  In  either  case,  at  the 
end  of  the  infectious  period,  the  affected  person  is  rather  literally  removed  from  the 
community.  This  means  that  there  are  fewer  left  in  it  than  there  were.  If  at  time  t 
there  are  x  susceptibles  in  the  community,  y  infectives  and  z  =  n  +  1  —  (x  +  y) 
who  have  been  isolated  or  who  have  died,  then  the  corresponding  infective  contact 
rate  is 


Pxy 

x  +  y  -  1' 


Actually  the  deterministic  model  is  not  too  difficult  to  solve,  but  we  shall  not 
pursue  it.  We  now  turn  to  the  stochastic  version  of  the  general  epidemic  model 
with  recovery,  previously  described. 


4  The  stochastic  general  epidemic  model 


The  main  characteristics  of  the  stochastic  general  epidemic  model  remain  as  they 
were  for  the  deterministic  formulation:  there  are  three  categories  of  person  in  a 
closed  community  of  size  n  +  1 — susceptibles,  infectives  and  removals.  The  numbers 
of  each  type  at  time  t  are  denoted  respectively  by  X(t),  7(t)  and  Z(t).  As  before,  if 
we  want  to  count  Z(t),  the  number  of  persons  to  have  recovered  from  the  disease 
at  time  t,  then  we  can  obtain  it  by  subtraction:  Z(t )  =  n  +  1  —  (X(t)  +  T(r)).  In  this 
section  we  shall  examine  the  bivariate  random  process 

Y(t));  t  >  0}, 

where  X(t)  and  Y(t)  denote  respectively  the  number  of  susceptibles  and  the  number 
of  infectives  at  time  t. 

Remember  that  for  this  model,  any  individual  who  is  infected  necessarily  passes 
through  the  three  stages 

SUSCEPTIBLE  — ►  INFECTIVE  — ►  REMOVED. 

This  means  that  the  random  process  {(Af(t),  T(t))}  changes  state  whenever  one  of 
the  two  types  of  transition  susceptible  — ►  infective  or  infective  — >  removed 
occurs.  In  the  first  case,  when  an  infective  meets  a  susceptible  and  the  disease  is 
passed  on,  then  the  number  of  susceptibles  decreases  by  one  and  the  number  of 
infectives  increases  by  one.  In  the  second  case  (infective  — ►  removed),  an 
infective  recovers  from  the  disease,  so  the  total  number  of  infectives  decreases  by 
one.  (The  number  of  removals  increases  by  one:  but  we  are  not  counting  them.) 

We  have  already  derived  in  Equations  (3.1a)  and  (3.1b)  the  two  conditional 
probability  statements  that  characterize  the  general  epidemic.  They  are  repeated 
here: 

P(X(t  +  St)  =  x  -  1,  Y(t  +  St)  =  y  +  1 1  X(t)  =  x,  Y(t)  =  y)  =  ^  St  +  o(dt)  (4.1a) 
and 

P(X{t  +  St)  =  x,Y(t  +  St)  =  y-  1 1  X(t)  =  x,  Y(t)  =  y)  =  yy  St  +  o(8t). 


(4.1b) 


In  the  deterministic  case,  you  saw  that  the  epidemic  terminates  once  y,  the  number 
of  infectives,  reaches  zero.  The  same  happens  here:  once  T(t)  =  0,  there  is  nobody 
left  to  transmit  the  disease,  and  nobody  to  recover  from  it.  These  are  the  only  two 
events  which  would  constitute  a  transition  from  one  state  to  another.  The 
community  remains  in  the  final  state  for  ever  after— composed  merely  of 
individuals  who  have  either  never  caught  the  disease,  or  who  have  caught  it  and 
'recovered. 

A  compact  description  of  the  general  epidemic  process  may  be  given  in  terms  of 
transition  rates,  as  in  Table  4.1. 


In  fact,  any  realization  of  an 
epidemic  would  be  over  within  finite 
time,  but  it  is  convenient  to  use  the 
notation  ,  0)  to  denote  the  final 
state  of  the  random  process 

{(*«).  no)}. 


Table  4.1  Transition  rates  for  the 
general  epidemic 

Transition  Rate 


(x,y)  — >  (x  -  l,y  +  1) 
—►(*,}'-  1) 


Pxy 

n 

yy 


The  same  information  is  shown  diagrammatically  in  Figure  4.1. 

It  is  possible  to  investigate  the  probabilities  px,y(t)  defined  by 
Pxjt)  =  P(X(t)  =  X,  Y(t)  =  y) 

—strictly,  the  conditional  probabilities  px>y{t)  =  P(X(t )  =  x,  Y{t )  =  37 1  AT(0)  =  x0, 
^(O)  =  y0)— there  are  a  number  of  alternative  methods.  However,  the  formal 
solutions  obtained  are  algebraically  very  complicated  and  of  little  help  in 
developing  an  understanding  of  the  behaviour  of  the  model.  No  more  will  be  said 
in  this  course  about  the  probability  distribution  over  time  of  the  bivariate  random 
process  {(A"(r),  T(t));  t  >  0}. 

Instead,  just  as  we  did  for  the  deterministic  version  of  the  model,  we  shall 
concentrate  on  describing  the  path  that  the  epidemic  process  maps  out  with 
passing  time  in  the  (x,  y)-plane  and  on  the  number  of  susceptibles  who  survive  the 
epidemic  uninfected.  In  the  deterministic  model,  that  path  was  a  curve  which 
started  at  (x0,y0)  and  ended  at  (x^O).  In  the  stochastic  model,  the  path  of  the 
epidemic  is  random;  it  takes  values  only  over  the  integers,  so  it  will  not  be 
smooth;  and  the  number  of  survivors  Xm  is  a  random  variable.  The  investigation 
of  the  survivor  distribution — the  distribution  of  X^— will  be  one  of  the  main 
concerns  of  this  section. 


Figure  4.1  Transition  rates  for  the 
general  epidemic 


4.1  Setting  up  the  model 

The  path  of  the  epidemic  in  the  (x,  y)-plane  is  the  path  taken  by  the  discrete-time 
two-dimensional  Markov  chain 

{(Xj,YJyj  =  0,  1,2,...} 

embedded  in  the  continuous-time  epidemic  process.  This  describes  the  state  of  the 
process  after  the  ;'th  transition— we  take  X0  =  x0,  Y0  =  y0.  (And  often,  though  not 
necessarily,  x0  =  n,  y0  =  1  and  z0  =  0.) 

For  an  epidemic  currently  in  the  state  (x,y),  where  y  >  0,  the  next  state  that  it 
moves  to  will  be  either  (x  -  1,  y  +  1)  or  (x,  y  -  1).  The  transition  rates  are  given  in 
Table  4.1.  It  follows  from  considering  their  relative  sizes, 

fayl"  x  and  yy  p 

fixy/n  +  yy  p  +  x  fixy/n  +  yy  p  +  x’ 


22 


where  p  is  the  quantity  ny/fi  as  in  the  deterministic  model,  that  the  respective 
probabilities  of  the  two  possible  transitions  are  as  given  in  Table  4.2. 


Table  4.2  Transition  probabilities  for  the 
general  epidemic  ( y  >  0) 


Transition 

Probability 

(x,y)  (x  -  1  ,y  +  1) 

X 

p  4-  x 

(*>  y)  — *■  (x,  y  -  1) 

P 

p  +  X 

These  transition  probabilities  are  also  shown  in  Figure  4.2. 

An  infection  corresponds  to  a  move  diagonally  upwards  and  to  the  left,  and  a 
removal  to  a  move  downwards.  Three  different  examples  of  realizations  of  an 
epidemic  are  shown  in  Figure  4.3.  In  each  case  the  epidemic  starts  at  x0  =  13, 
y0  =  2,  and  the  parameter  p  has  been  taken  to  be  equal  to  6.  The  epidemic 
terminates  when  y  =  0;  that  is,  when  the  path  hits  the  x-axis.  The  value  of  x  at 
this  point  is  the  number  of  susceptibles  from  the  original  13  who  have  survived  the 
epidemic  uninfected  (in  the  cases  shown,  11,  0  and  1).  Notice  that  the  path  could 
hit  the  y-axis  first,  in  which  case  all  the  remaining  infectives  are  gradually 
removed,  and  the  epidemic  ends  at  the  point  (0, 0).  That  is,  Zw  =  n  +  1 :  everybody 
in  the  community  has  been  afflicted  with  the  disease  and  recovered  from  it. 


Figure  4.2  Transition  probabilities 
for  the  general  epidemic 


This  cannot  happen  in  the 
deterministic  case  of  the  model. 


Notice  that  the  transitions  are  such  that  the  path  can  never  pass  twice  through  the 
same  point.  In  addition,  given  (x0,y0),  the  path  is  restricted  to  values  of  (x,y)  such 
that  x  +  y  <  x0  T  y0  and  x  <  x0. 

We  shall  use  the  idea  of  the  embedded  Markov  chain  to  find  the  survivor 
distribution  in  a  particular  case.  The  example  will  be  developed  in  the  audio¬ 
cassette  session  which  follows. 


Example  4.1 

Suppose,  in  a  particular  community  of  size  4,  that  one  person  is  infected  with  a 
disease:  the  other  three  are  susceptible.  Suppose  that  /?  =  1.8  and  y  =  1.2. 

Now  switch  on  the  tape. 


23 


Calculating  the  survivor  distribution 

*0  =  3,  y0  =  l.  z0  =  0,  ^  =  1.8,  K  =  1.2 


x0  +  y0+z0  = 


Indicate  by  arrows 
the  possible  transitions, 
and  show  their 
probabilities. 


so  n  =  5 


Complete  the  diagram, 
showing  all  possible 
transitions  and  their 
probabilities. 


.  So  p  = 


P[(3,1)->(3.0)]=  ,  so  P(X0O  =  3)  = 


24 


25 


.6)  /! - 1  I - 1  I - lx  I - 1  I - 1 

^  P(Xm  =  0)  =  I  -  P(XM  =  !  or  2  or  3)  =  I  ~(| _ |+| _ |+| _ | )  =  l  -  | _ = _ 


FINISH 


(  ,  )-M,  )^(,  )-*(,)-»•(  ,  )->(  ,) 


(  ,  )  (,)->(,)  -*  (,)->(,)  -*  (  ,  ) 


(  .  )  “*■  (  .  )  ( ,  )_>(,  )  ,  )^(  ,) 


+  + 


+  yo  +  20  =  |_4J  ,  so  n  =  [_3j .  So  p=-p-  =  Z 


26 


27 


28 


You  found  during  the  audio-cassette  session  that  the  probability  distribution  of 
Xx,  the  number  of  surviving  uninfected  susceptibles,  is  (to  three  decimal  places)  as 
below. 


X 

0  1 

2 

3 

P(Xm  =  x) 

0.294  0.156 

0.150 

0.400 

The  probability  distribution  is  shown  in  Figure  4.4.  Notice  the  shape  of  this 
probability  distribution.  The  most  likely  thing  to  happen  (X„  =  3)  is  that  nobody 
catches  the  disease  at  all — the  single  initial  infective  recovers  without  having 
infected  anybody  else  in  the  community.  The  next  most  likely  thing  to  happen 
(-^•oo  =  0)  is  precisely  the  opposite:  that  is,  everybody  catches  the  disease. 

The  distribution  of  Xx  is  U-shaped.  This  phenomenon  is  related  to  the  threshold 
effect  of  the  deterministic  model.  We  shall  be  returning  to  this  point  in 
Subsection  4.2. 

Question  4.1  Consider  the  epidemic  of  Example  4.1. 

(i)  What  is  the  expected  number  of  survivors? 

(ii)  What  does  the  deterministic  model  give  for  the  number  of  surviving  uninfected 
susceptibles?  □ 

Example  4.1  demonstrates  that  even  for  small  communities,  calculation  of  the 
survivor  distribution  can  be  rather  involved.  For  general  p,  the  survivor 
distribution  when  x0  =  3,  y0  =  1,  is  shown  in  Table  4.3. 


Table  4.3  Survivor  distribution,  x0  =  3,  y0  =  1 


Number  of  survivors 

Probability 

0 

6(5p3  +  12p2  +  8p  +  2) 

(p  +  3  )(p  +  2)2(p  +  l)3 

1 

6p3(2p  +  3) 

(p  +  3  )(p  +  2  )2(p  +  l)3 

2 

3  P2 

( P  +  3  )(p  +  2)2 

3 

P 

p  +  3 

Question  4.2 

(i)  Show  on  a  diagram  like  the  one  in  Frame  2A  the  possible  transitions  and 
associated  probabilities  for  a  general  epidemic  with  x0  =  2  and  y0  =  1  (and  for 
general  p). 

(ii)  Find  the  survivor  distribution. 

(iii)  Calculate  explicitly  the  survivor  distribution  for  the  cases  p  =  1,  2,  10,  and 
comment  on  the  shapes  of  the  different  distributions.  □ 

4.2  The  threshold  phenomenon 

As  is  evident  from  Question  4.2  and  Table  4.3,  the  algebraic  complexity  of  the 
expressions  in  the  survivor  distribution  rapidly  builds  up  as  the  initial  number  of 
susceptibles  increases.  Formal  general  formulas  for  the  probabilities  have  been 
obtained,  but  we  shall  not  deal  with  them  in  this  course:  more  insight  into  the 
behaviour  of  the  model  is  gained  from  numerical  calculations.  It  is  a  simple  matter 
to  write  a  computer  program  that,  for  given  p,  will  calculate  the  survivor 
distribution 

P(X«>  =  x  |  *o  =  Y0  =  y0),  x  =  0,  1,2,...,  x0, 
for  any  given  pair  (x0,y0)-  The  results  of  such  calculations  for  x0  =  20,  y0  =  1,  are 
given  in  Table  4.4  for  four  different  values  of  p,  together  with  illustrative 
probability  diagrams  in  Figure  4.5  (on  page  31).  These  results  may  be  regarded  as 


P(X^=  X)  a 

0.4- 


0.2i 


0  12  3 

Figure  4.4  The  probability 
distribution  of  X ^ 


There  is  no  need  to  check  this. 


the  stochastic  analogue  of  the  survivor  numbers  xffl  for  the  deterministic  model, 
given  in  Table  3.1. 


Table  4.4  Survivor  probabilities  for  x0  =  20,  y0  =  1 


Survivors 

p  =  5 

Probability 

p  =  10  p  =  20 

II 

O 

0 

0.4825 

0.0473 

0.0003 

0.0000 

1 

0.1558 

0.0707 

0.0011 

0.0000 

2 

0.0529 

0.0705 

0.0027 

0.0000 

3 

0.0209 

0.0602 

0.0047 

0.0000 

4 

0.0096 

0.0483 

0.0070 

0.0000 

5 

0.0050 

0.0378 

0.0093 

0.0001 

6 

0.0030 

0.0297 

0.0114 

0.0002 

7 

0.0020 

0.0237 

0.0133 

0.0004 

8 

0.0015 

0.0194 

0.0149 

0.0007 

9 

0.0012 

0.0164 

0.0163 

0.0013 

10 

0.0011 

0.0144 

0.0176 

0.0020 

11 

0.0011 

0.0131 

0.0190 

0.0032 

12 

0.0011 

0.0125 

0.0205 

0.0048 

13 

0.0013 

0.0125 

0.0225 

0.0072 

14 

0.0016 

0.0133 

0.0253 

0.0107 

15 

0.0023 

0.0151 

0.0294 

0.0159 

16 

0.0035 

0.0184 

0.0359 

0.0244 

17 

0.0062 

0.0250 

0.0472 

0.0393 

18 

0.0127 

0.0391 

0.0701 

0.0698 

19 

0.0347 

0.0793 

0.1315 

0.1532 

20 

0.2000 

0.3333 

0.5000 

0.6667 

When  p  =  5  and  when  p  =  10,  the  survivor  distribution  is  U-shaped — with  high 
probability  there  is  either  a  large  or  a  small  number  of  survivors,  but  intermediate 
numbers  are  unlikely:  sometimes  an  epidemic  either  takes  off  (everybody,  or  nearly 
everybody,  catches  the  disease)  or  does  not  (hardly  anybody  at  all  catches  the 
disease). 

It  has  been  found  from  a  large  number  of  such  calculations  that,  as  a  general  rule, 
U-shaped  survivor  distributions  occur  when  x0  >  p.  If  x0  <  p,  then,  as  illustrated 
above,  with  high  probability  there  is  only  a  minor  epidemic  outbreak,  but  if  x0  >  p 
then  with  high  probability  there  is  either  a  minor  or  a  major  outbreak,  with 
outbreaks  of  intermediate  size  unlikely. 

As  in  the  deterministic  version  of  the  model,  there  is  a  threshold  phenomenon,  with 
p  a  threshold  value  for  the  initial  number  of  susceptibles.  However,  the  nature  of 
the  threshold  phenomenon  is  more  complicated  in  the  stochastic  case,  since  when 
x0>  p  there  is  the  possibility  of  a  minor  outbreak — a  possibility  not  present  in 
the  deterministic  case. 

To  deepen  our  understanding  of  the  threshold  phenomenon,  we  shall  consider  an 
approximation  to  the  stochastic  model  during  the  initial  stages  of  an  epidemic 
outbreak.  Assuming  that  x0  is  large,  then  during  the  initial  stages  while  x  does  not 
differ  much  from  x0,  we  can  write  the  transition  rate  of  susceptibles  to  infectives 
pxy/n  as  fix0y/n.  Using  this  approximation,  during  the  early  stages  of  an  epidemic 
we  have  the  approximate  transition  rates  of  Table  4.5. 


Table  4.5  Approximate  transition 


rates,  early  in  an 

epidemic 

Transition 

Rate 

y— *y  +  l 

px0  y 

n 

y  *■  y  —  i 

yy 

These  are  the  transition  rates  of  a  simple  birth-death  process,  of  the  type 
described  in  Unit  8,  with  individual  birth  rate  fix0/n  and  death  rate  y. 


30 


If  f5x0/n  <  y — that  is,  if  x0  <  p — then  the  simple  birth-death  process  is  certain  to 
become  extinct.  If  f3x0/n  >  y— that  is,  if  x0  >  p— then,  from  Unit  8,  Equation  (3.7), 
the  process  becomes  extinct  with  probability 


(4.2a) 


where  y0  is,  as  before,  the  initial  number  of  infectives.  With  probability 


the  birth-death  process  will  explode.  The  event  that  the  approximating  infective 
process  becomes  extinct  corresponds  to  a  minor  epidemic  outbreak,  and  the  event 
that  the  process  explodes  corresponds  to  a  major  outbreak.  According  to  our 
approximation, 

for  x0  >  p,  the  probability  of  a  minor  outbreak  is  given  by  the  expression 

(4.2a)  and  the  probability  of  a  major  outbreak  by  the  expression  (4.2b). 

In  particular,  the  sum  of  the  probabilities  on  the  right-hand  side  of  a  U-shaped 
survivor  distribution  should  be  approximately  equal  to  the  value  of  the  expression 
(4.2a).  It  is  a  matter  of  subjective  judgement  how  many  terms  we  should  include  in 
the  right-hand  side  of  the  U-shape.  A  comparison  of  the  approximation  with  the 
sum  of  the  probabilities  on  the  right-hand  side  of  the  U-shape,  using  values  taken 
from  Table  4.4,  is  made  in  Table  4.6.  In  this  case  x0  =  20,  y0  =  1. 


Table  4.6  Probabilities  of  a  minor  outbreak 


P  =  5 

p  =  10 

Approximating  probability  ( p/x0)y° 

0.250 

0.500 

Sum  of  4  terms 

0.254 

0.477 

Sum  of  5  terms 

0.257 

0.495 

The  threshold  phenomenon  can  be  summarized  in  a  diagram  as  shown  in 
Figure  4.6. 


Figure  4.6  The  threshold  phenomenon 


Question  4.3  Consider  the  following  two  specifications  of  a  general  epidemic. 

(a)  x0  =  20,  y0  =  2,  z0  =  0,  f}  =  4.2,  y  =  1. 

(b)  xQ  =  222,  y0  =  4,  z0  =  0,  0=1,  y  =  1. 

In  which  case(s)  is  it  appropriate  to  apply  Expressions  (4.2a)  and  (4.2b)?  Where 
appropriate,  estimate  the  probability  of  a  minor  outbreak  and  of  a  major 
outbreak.  □ 

This  is  a  suitable  point  at  which  to  watch  Band  F  of  the  video-cassette. 


1 

<1>I» 

32 


5  Other  models  for  epidemics 


In  this  unit  we  have  studied  two  basic  models— the  simple  epidemic  and  the 
general  epidemic.  Both  these  models  assume  that  the  epidemic  can  be  modelled  by 
a  Markov  process  in  continuous  time,  and  also  that  the  latent  period  is  zero. 
However,  for  many  diseases,  neither  of  these  assumptions  is  realistic,  so  other 
models  have  been  developed  in  the  study  of  epidemics. 

One  model  has  been  developed  to  attempt  to  describe  the  spread  of  an  infectious 
disease  for  which  the  latent  period  is  non-zero  but  varies  little  in  length  from  case 
to  case,  and  for  which  there  is  only  a  short  length  of  time  during  which  an  infected 
individual  can  pass  on  the  disease  before  removal,  usually  by  isolation.  This  model 
has  been  successfully  used  to  describe  the  spread  of  measles  through  families.  The 
incubation  period  is  about  10  to  14  days  and  the  disease  does  tend  to  occur  in  the 
children  of  a  family  at  intervals  of  about  this  length.  A  simplified  model  has  been 
developed  using  a  Markov  chain  where  the  (discrete-time)  random  process  is  the 
number  of  infectives,  counted  at  (say)  12-day  intervals.  So  this  model  assumes  that 
the  latent  period  is  constant  and  the  infectious  period  is  virtually  zero. 

The  models  which  have  been  developed  have  assumed  that  homogeneous  mixing 
occurs  within  the  community.  This  is  not  always  a  realistic  assumption,  and  more 
complicated  models  can  allow  for  other  mechanisms. 

Epidemic  models  have  been  developed  to  model  the  rate  at  which  a  disease 
spreads  spatially.  This  has  recently  been  applied  to  rabies,  a  typical  question  of 
interest  being:  if  there  is  an  outbreak  in  one  part  of  a  country,  how  long  a  time 
will  elapse  before  the  disease  travels  100  miles?  This  model  must  take  account  of 
the  behaviour  pattern  of  the  dogs  and  foxes  who  spread  the  disease. 

Other  diseases  are  spread  by  insect  carriers.  For  example,  malaria  is  spread  by 
single-celled  parasites  which  are  carried  by  mosquitoes,  and  people  are  infected 
through  being  bitten  by  infected  mosquitoes.  Models  for  the  spread  of  malaria 
must  take  account  of  the  life  cycle  of  mosquitoes. 

At  the  present  time  there  is  much  active  research  work  on  modelling  the  spread  of 
AIDS.  Features  which  must  be  covered  by  the  models  are,  for  example,  that  the 
incubation  period  can  be  very  long;  and  that  people  can  be  carriers  and  pass  on 
the  disease  without  knowing  that  they  have  it.  The  disease  is  so  new  that  many 
factors,  like  the  distribution  of  the  incubation  period  and  the  method  of  spread,  are 
not  known  completely.  Although  this  is  only  one  aspect  of  research  into  AIDS,  it 
is  very  important  that  estimates  of  the  total  epidemic  size  should  be  available. 

In  this  unit  there  has  been  room  for  only  an  introduction  to  the  study  of 
epidemics.  Research  workers  in  probability  are  continuing  this  work  and  are 
constantly  refining  models  to  increase  their  usefulness. 


Objectives 


After  studying  this  unit,  you  should  be  able  to: 
calculate  the  epidemic  parameter  in  a  given  situation; 

.  use  the  deterministic  version  of  the  simple  epidemic  model  to  calculate  the 
progress  of  a  disease  through  a  community; 

calculate  the  probabilities  in  the  stochastic  version  of  the  simple  epidemic 

model  when  the  size  of  the  community  is  very  small; 

find  the  distribution  of  the  duration  of  the  stochastic  simple  epidemic  when  the 
size  of  the  community  is  very  small; 

calculate  the  mean  and  variance  of  the  duration  of  the  stochastic  simple  epidemic; 

use  the  deterministic  version  of  the  general  epidemic  model  to  calculate  survivor 
numbers; 

understand  the  significance  of  the  size  of  the  number  p  in  plotting  the  trajectory  of 
the  deterministic  general  epidemic; 

use  the  Markov  chain  embedded  in  the  stochastic  version  of  the  general  epidemic 
model  to  calculate  probabilities  for  the  survivor  distribution  in  a  general  epidemic 
when  the  size  of  the  community  is  very  small; 

calculate  approximations  to  the  probabilities  that  a  general  epidemic  will  take  off 
or  not,  when  x0  >  p. 


Appendix:  Solutions  to  questions 


Section  1 

l.l(i)  You  contact  the  n  other  members  of  the  community 
according  to  a  Poisson  process  at  rate  p.  Assuming 
homogeneous  mixing,  the  probability  that  a  particular 
contact  is  Gary  is  1  jn.  Therefore  you  meet  Gary  according  to 
a  Poisson  process  at  rate  p/n. 

(ii)  The  probability  that  a  particular  contact  is  with  one  of 
the  n  +  1  —  y  well  people  is  (n  +  1  -  y)/n.  You  come  into 
contact  with  well  people,  therefore,  according  to  a  Poisson 
process  at  rate  P(n  +  1  —  y)/n. 

(iii)  Each  of  the  ill  people  meets  well  people  at  the  same 
average  rate  that  you  do:  P(n  +  1  -  y)/n.  There  are  y  of  you 
altogether.  In  total,  contacts  between  ill  and  well  people 
occur  according  to  a  Poisson  process  at  rate  Py(n  +  1  -  y)/n. 


Sectiori  2 

2.1  This  is  really  just  Question  1.1  repeated.  If  at  time  t 
there  are  y  infectives  and  n  +  1  —  y  susceptibles,  and  any 
particular  member  of  the  community  meets  others  at  rate  p, 
then  contacts  between  infectives  and  susceptibles  occur  at 
rate  Py  =  Py(n  +  1  —  y)/n. 

2.2  Of  all  the  infective-susceptible  contacts,  only  a 
proportion  6  lead  to  the  disease  being  passed  on.  So  what 
are  called  ‘infective  contacts’  occur  as  a  Poisson  process  at 
rate  Py  —  (Qp)y(n  +  1  —  y)/n.  This  refinement  would  not  be 
complicated  to  introduce  into  the  model. 

2.3  (i)  If  A 

dy  _  py(n  +  1  -  y) 
dt  n  ’ 

then  separating  the  variables  and  integrating  gives 


— - n - ~dy  =  f  P  dt. 

J  y(n  +  1  -  y)  J  H 


Now 


1 


1 


1 


y(n  +  1  -  y)  n  +  1  \y  n  +  1  -  y 
and  so  the  equation  can  be  rewritten  as 


n  17 1  _ 1_ 

n  +  lj\y  n+1 


-  y 


dy  = 


pdt. 


This  gives 

Yi 

n  i  i  flog  y  ~~  1  —  y))  —  Pt  +  constant, 

which  simplifies  to 

y 


log 


n  +  1  \n  +  1  —  y 
(ii)  If  y  =  1  at  time  0,  then 


=  pt  +  constant. 


n  +  1 


log 


1 


n  +  1  —  1 


=  0  +  constant, 


n  1 

constant  — - -log-. 

n  +  1  n 


The  particular  solution  corresponding  to  y0  =  1  is  therefore 


n  1 
n  +  1  n 


given  by 

n 

log( 

(  y  \ 

II 

+ 

n  +  1 

n  +  i  -«  y) 

or 

n 

log( 

'  ny  \ 

1  =  Pt- 

n  +  1 

\n  +  1  -  y) 

Taking  exponentials,  we  have 


ny 


—  exp 


i+l 

n 


Pt  • 


n  +  1  —  y 
Solving  this  equation  for  y  gives 
n  +  1 


y  = 


nexpi  — 


2.4  If 

y  =  - 


»+! 

n 


n  +  1 


Pt  +1 


nexpi  — 


1  +- 
n 


Pt  ]  +  1 


then 

dy 

dt 


P(n  +  l)2exp  — 


1  +- 
n 


Pt 


nexp  — 


1  + 


1 


Pt  +1 


(An  alternative  approach  would  be  to  use  Equation  (2.2), 
dy  =  Py(n  +  1  -  y) 
dt  n  5 

then  substitute  for  y  on  the  right-hand  side  using  Equation 
(2.3),  to  obtain  the  same  answer  as  above.) 

2.5  (i)  The  information  given  in  the  question  is  that  y0  —  1, 
n  +  1  =  112  and  P  =  4  per  day.  Then  according  to  the 
deterministic  model,  the  number  of  infectives  at  time  t  is, 
from  Equation  (2.3), 

n  +  1 

y  =  - 


(- 

+ 

▼“H 

l _ 

V 

L  HJ 

pt  +  1 


112 


111  exp(- 4.036 1)+  V 

where  t  is  measured  in  days.  When  t  =  1,  y  =  37.8. 
Approximately  38  inhabitants  of  the  village  are  infected  after 
one  day.  After  just  two  days,  y  =  108.2.  Only  about  four  of 
the  original  susceptibles  are  still  free  of  the  disease. 

(ii)  The  disease  is  spreading  fastest  when,  from  Equation 
(2.5), 

1 1 1  log  111 


t  = 


4  x  112 


=  1.17. 


(iii)  The  whole  village  is  infected  when  y  —  112,  which 
occurs,  according  to  the  deterministic  model,  only  after 
infinite  time.  (But  y  =  111.9  after  only  3  days.) 

2.6  The  differential-difference  equations  for  the  general 
birth  and  death  process  are  of  the  form 
d 

Py(t)  Py  —  1  Py  —  l(0  T  Vy+  l  Py+  l(t)  (Py  T  Vy)  Py{t), 

y  =  o,i,  2,..., 

where  /?_!  =  v0  =  0.  In  the  case  of  the  simple  epidemic, 
vy  —  0  for  y  =  1,  2,  ...  as  well,  and  the  corresponding 
differential-difference  equations  take  the  form  of  a  general 
birth  process: 

JtPy(t)  =  Py  —  1  Py  —  l(0  “  PyPy(t),  T  =  0,  1,  2,  ...  , 

where,  in  this  case, 

0 


Py 


py(n  +  1  -  y) 


y=  -l 

y  =  0,  1, ...,  n  +  1 
y  =  n  +  2,  n  +  3,  . . 


35 


Since  /?0  =  0,  it  follows  that  —  p0(t)  =  0.  For 

at 

y  =  l,  2, n  +  1  the  equations  are  of  the  form 

d  n  (t\-  P(y  ~  W”  + 2  -  y)  _  M  Py(n  +  i  -  y) 

-p>.(r)  _ - - - p,-i(0 - - - P,to- 

(The  final  equation  in  the  sequence  is 

jtP«  +  \{t)  =  PPn(t)-) 

2.7(i)  With  a  single  individual  infected  initially,  the 
differential  equations  necessary  to  find  the  probability 
distribution  of  Y(t)  are 
d 

^  Py(0  —  PyPy{t)  +  Py  -  1  Py  -  1  {t),  }'  =  1,  2,  3,  4, 
where 

0y(ra+l-y)  3y(4  -  y) 

ft - - - - 3— 

so  /),  =  3,  /?2  =  4,  /?3  =  3.  We  have  the  sequence  of  four 
differential  equations 


i  ., 
dtpM  = 

-3pi(r), 

(a) 

jtPM  = 

-4  p2(t)  +  3pi(t). 

(b) 

&-I  Si. 
? 

II 

-3  p3(t)  +  4  p2(0. 

(c) 

d 

3  p3(t). 

(d) 

Since  one  individual  is  infected  initially, 

Pi(0)=l,  p2(0)  =  0,  p3(0)  =  0,  p4(0)  =  0. 

(ii)  Solution  of  Equation  (a)  gives 
Pi(t)  =  constant  x  e~3t; 

and  using  the  initial  condition  pl(0)=  1,  it  follows  that  the 
constant  is  1,  so 


Pi(t) 


-  „-3 I 


Using  this  result  in  Equation  (b)  gives 
+  4p2(0  =  3e~3', 

which  is  of  the  form  dp/dt  +  A(t)p  =  B{t).  Multiplying  both 
sides  by  exp(j  4  dt)  =  gives 

^P2(0^4‘  +  4p2(t)e4'  =  3«*. 

This  can  be  rewritten  as 
^(p2We4')  =  3e*. 

Integrating  with  respect  to  f,  we  obtain 


^(p2(t)<?4')d*  = 


3er  dt, 


or 


p2(t)c4'  =  3e'  +  constant. 

From  the  initial  condition  p2(0)  =  0,  it  follows  that  the 
constant  is  —3;  so 

p2(t)e4'  =  3(e'  -  I), 
which  gives 

p2(t)  =  3<T3‘  -  3e~4t. 

We  now  set  this  result  into  Equation  (c),  which  becomes 

£p3(t)  +  3p3(t)=l2e-3‘-12e-*t, 

again  of  the  form  dp/dt  +  A(t)p  =  B(t).  Multiplying  both 
sides  by  exp(J  3  dt)  =  e3t  gives 

^PswV'  +  3 p3(t)e3t  =12-  12e~‘, 


which  may  be  rewritten  as 

^(P3(0e3')  =  12  -  I2e~‘. 

Integrating  with  respect  to  t,  we  obtain 
p3(t)e31  =  12 1  +  12e~l  +  constant. 

From  the  initial  condition  p3(0)  =  0,  it  follows  that  the 
constant  is  — 12;  so 

p3{t)e3,=  12t  +  12e~t-  12, 
which  gives 

p3(t)  =  12te_3‘  -  12e~3t  +  12<r4f. 

Finally,  setting  this  result  into  Equation  (d),  we  have 

^p4(t)  =  36 te~3t  -  36e~3t  +  36e“4t. 

By  integration  by  parts. 

36 te~3,dt  =  36t(  — ie_3r)  -  -  12 e~3tdt 

=  —  12fe_3r  —  4e~3t 
=  -Ae~3\\  +3t). 

So  integrating  the  equation  for  p4(r),  we  obtain 

P4(0  =  —  4e  3'(1  +  3r|  +  I2e~3t  —  9e~4'  +  constant 
=  Se  3t  —  12te  3:  —  9e~*t  +  constant, 
and  since  p4(0)  =  0,  it  follows  that  the  constant  is  1.  The 
distribution  of  T(r)  is 

Pi(t)  =  e~3t, 

p2(t)  =  3e~3t-3e~At, 

p3(t)  =  12 te-3t  -  12e~3t  -  12e~+', 

p4(t)  =  8e_3t  -  12te_3r  -  9e~M  +  1. 

(iii)  As  t  -»  oo,  pj(£),  p2(n  and  p3(r)  approach  zero;  p4(t)-»  1. 
Eventually,  progress  of  the  disease  through  the  whole 
household  is  certain. 

2.8  The  cumulative  distribution  function  of  W  is 
P(W<  t)  =  P(Y{t)  =  4) 

=  8e-3t  —  12xe~3:  —  9e-4*  +  1, 
from  Solution  2.7. 

Hence 


f 


E{W)=\  P(W>  t)dt 

(12£e~3'  +  9e_4<  -  8e"3Vf- 

VK) 

J*°o 

ate  pt dt  =  ( a/P)  fhe dt  =  a  (l2  immediately, 

o  Jo 

f  OO 

since  pte~p,dt  is  the  expectation  of  We  have 


already  seen  that  J  a e  ptdt  =  a  ft.  So 

E(W)  =  ¥  +  l~  S  =  fc. 

As  a  check,  using  Result  (2.12), 

"1  3  1 

Em=  Y.j=Zj 

j=y0  Pj  j=iPj 

Pi  P 2  Pi 


=  7  +  i  +  3,  using  Solution  2.7, 


_  n. 
—  12- 


nw)= 


The  variance  of  W  is 

J_ 

.i~J  P? 
j=y0  Pj 

—  b  +  i  =  ~itt- 

(It  would  be  a  very  long  process  indeed  to  obtain  this  from 
the  probability  density  function  of  W.) 


36 


2.9  The  general  result  is 

B(wo=li 

=  ij 

j=lPj 

n 

=  I 


Ai  pj{n  +  l-  j) 


=  -  y  - 

ft  ^  ,v 


1 


Pj=ij'(n  + 

Rewriting  the  fraction  l//(n  +  1  -;')  as 
(1//  +  l/(«  +  1  —;))/(«  +  1),  we  have 


£(W0  = 


I \l  + 


1 


(n+l)Pj-t\j  n+l-j 
n  [Y  1  1 

(n+  1)ALV  +2  +  '--  +  n 


+  -  + 


n  n  —  1 


2  n 


si 


(n  +  1)/?  A  / 

If  P  =  1,  then 

(i)  when  n  =  5, 

£(I*0  =  ^(1  +  i  +  ...  +  i)  =  ^(2.2833)  =  3.806; 

(ii)  when  n  =  10, 

£(W0  =  f?(l  +  i  +  ...  +  A)  =  f¥(2.9290)  =  5.325; 

(iii)  when  n  —  20, 

E{W)  =  f?(l  +  i  +  . . .  +  &)  =  f?(3.5977)  =  6.853. 


2.10  Using 

1  +  ‘  +  ...+i^iog(„  +  1)__i_+0.5772i 

then 

(i)  when  n  =  5, 

E(W)  ~  ^(log6  -  -h  +  0.5772)  =  3.809; 

(ii)  when  n  =  10, 

E(W)  ~  ff (log  11-^  +  0.5772)  =  5.327; 

(iii)  when  n  =  20, 

E(W)  ~  f?(log  21  -  -h.  +  0.5772)  -  6.853. 

The  first  two  approximations  are  very  good;  the  third  is 
better,  being  correct  to  the  third  decimal  place. 

(iv)  When  n  =  200, 

E(W)  cz  f§f(log  201  -  +  0.5772)  =  11.70. 

(v)  When  n  =  2000, 

E{W)  ~  f$ff(log2001  -  Wsi  +  0.5772)  =  16.35. 


Section  3 

3.1  (i)  Integrating  with  respect  to  x,  the  differential 
equation 

dy  _  p  -  x  =  £ 
dx  x  x 
gives 


hence, 

y  =  p  log  x  —  x  +  constant 
or 

x  +  y  =  p  log  x  +  constant. 

(ii)  Using  the  initial  values  x0  and  y0,  it  follows  that 
x  +  y  =  plogx  +  (x0  +  y0  ~  P logx0), 
which  may  be  rewritten  with  terms  in  y  on  the  left-hand  side 
and  terms  in  x  on  the  right  as 

3>o  ~  y  =  P  log  x0  -  p  log  x  -  (x0  -  x) 

=  P  log  (~~)  (x0  x). 


3.2  If  y  =  0,  then 

y0  -  0  -  p  log  (^j  -  (x0  -  x  J, 
or 

Pl°8(?)  ~  (*°  ~  =  ^°- 
(It  is  not  possible  to  obtain  an  explicit  formula  for  x^  in 
terms  of  the  initial  values  x0,  y0  and  the  parameter  p.) 


3.3  In  this  case,  ^  =  0atx  =  xo  =  p  and,  from  Equations 

(3.4a)  and  (3.4b),  x  and  y  decrease  from  x0  and  y0, 
respectively. 


As  in  the  other  cases,  x^  <  p. 


3.4(i)  The  initial  states  for  this  epidemic  are  x0  =  30, 
y0  =  I  and  z0  =  0.  The  value  of  n  is  30.  The  contact  rate  is  5 
per  day;  however,  only  a  proportion  0.08  of  these  constitute 
the  infective  contact  during  which  the  disease  is  transmitted, 
so  P  =  5  x  0.08  =  0.4  per  day.  The  infectious  period  has 
mean  l/y  —  6  days,  so  the  recovery  rate  is  y  —  £  per  day.  It 
follows  therefore  that 


30  x  5 
0.4 


12.5. 


(ii)  The  epidemic  is  at  its  height  when  the  number  of 
susceptibles  is  x  =  p  =  12.5,  at  which  point  (see  Equation 
(3.10))  the  number  of  infectives  is 

y  =  yo  +  x0  -  p  -  p  log  ^ 

=  1  +  30-  12.5-  12.5  log 
=.7.56. 

The  number  who  have  caught  the  disease  but  at  this  stage 
have  recovered  from  it  is 
z  =  n  + l — x  —  y 
=  31  -  12.5  -  7.56 
=  10.94. 


(iii)  The  number  of  susceptibles  surviving  the  epidemic 
uninfected  is  x^,  the  smaller  solution  of  the  equation 
x  =  x0e[x-(x°+:v°)]/p 
=  30e(x-31)/12-5. 

Taking  xm  0  =  0  as  an  initial  guess,  the  formula  iteration 
method  gives  the  following. 


j 

0 

1 

2 

3 

4 

5 

6 

7 

8 

*00  J 

0 

2.512 

3.072 

3.212 

3.248 

3.258 

3.260 

3.261 

3.261 

If,  according  to  the  deterministic  model,  3.26  remain 
uninfected,  the  remaining  27.74  must  have  caught  the  disease. 
(One  would  round  to  the  nearest  integer  when  reporting  the  » 
conclusions  of  the  model,  as  one  cannot  really  talk  about 
fractional  people.) 


37 


(iv) 


Section  4 

4.1  (i)  The  expected  number  of  survivors  is 
£(^<»)  —  0  x  -nft)  +1  X33  +  2x^  +  3xf 
=  33  +  2b  +  5 
=  1.66. 


There  is  only  one  way  of  reaching  the  state  (1,0): 

^((2, 1)  — ►  (1,2)  — ►  (1, 1)  — »-  (1,0)) 

2  p  p 

~  f)  +  2X  p+\X  p+\ 

2p2 

(p  +  1  )2(p  +  2) 

By  subtraction,  the  probability  that  the  epidemic  terminates 
at  the  origin  is 

t  V _ p  _  2(2  p  +  1) 

(p  +  l)2(p  +  2)  p  +  2  (p  +  l)2(p  +  2) 

(There  are  two  ways  of  reaching  the  origin, 

P{( 2, 1)  — ^d,2)  — ^(0?3)  — ^  (0, 2)  — >  (0, 1)  — >  (0, 0)) 

2 

~(p  +  1  )(p  +  2)‘ 

P((2, 1)  — >  (1,2)  —  (1, 1)  (0,2)  (0, 1)  (0,0)) 

2  P 

(p  +  l)2(p  +  2) 

and  the  two  probabilities  sum  to  2(2p  +  l)/[(p  T  l)2(p  +  2)].) 
The  survivor  distribution  is  as  follows. 


(ii)  In  this  case,  x0  =  3,  j/0  =  1,  n  =  3,  p  =  2,  and  Equation 
(3.11)  gives 

x  =  3e(jc_4,)/2, 

which  has  solution  xa  =  0.53.  The  community  is  rather 
small:  the  deterministic  model  has  not  provided  a  very  clear 
indicator  of  what  is  ‘likely’  to  happen. 

4.2(i)  The  transition  probability  diagram  for  the  case 
x0  —  2,  y0  =  1,  looks  as  shown  below. 


(ii)  There  is  only  one  way  of  reaching  the  state  (2,0): 
P{(2, 1)  — >  (2,0))  =  p/(p  +  2). 


X 

0 

1 

2 

“T3 

* 

8 

II 

X 

2(2  p  +  1) 

2  P2 

P 

(p  +  1  )2(p  +  2) 

(P  +  1  )2(P  +  2) 

p  +  2 

(iii)  Corresponding  to  the  special  cases  p  =  1,  2,  10  are  the 
survivor  distributions  below. 


X 

0 

1 

2 

P(X  . 

=  x), 

p  =  1 

0.500 

0.167 

0.333 

P(X  w 

=  x), 

P  =  2 

0.278 

0.222 

0.500 

P(X  CO 

=  x), 

p  =  10 

0.029 

0.138 

0.833 

If  p  =  1,  the  distribution  is  markedly  U-shaped,  less  so  if 
p  =  2.  When  p  =  10,  the  distribution  is  J-shaped. 


4.3(a)  In  this  case  p  =  ~  =  X  * 
P  4.2 

(4.2a)  and  (4.2b)  do  apply.  Thus,  the 
outbreak  is  approximately 


■  =  5;  so  Expressions 
probability  of  a  minor 


and  the  probability  of  a  major  outbreak  is  approximately 

1  “  (i~)  =  °-9375- 

(The  sum  of  four  terms  on  the  left-hand  side  of  the  survivor 
distribution  is  actually  0.9057.) 

,,,  rT  ny  225  x  1 

(b)  Here  p  =  —  = - - - =  225  >  x0;  so  Expressions 

(4.2a)  and  (4.2b)  do  not  apply. 


38 


